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GTPASE ASSOCIATED PROTEINS 



TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of GTPase associated proteins 
and to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative, 
autoimmune/inflammatory, and immune system disorders. 



Guanine nucleotide binding proteins (GTP-binding proteins) participate in a wide range of 
regulatory functions in all eukaryotic cells, including metabolism, cellular growth, differentiation, 
signal transduction, cytoskeletal organization, and intracellular vesicle transport and secretion. In 

10 higher organisms they are involved in signaling that regulates such processes as the immune response 
(Aussel, C. et al (1988) J. Immunol. 140:215-220), apoptosis, differentiation, and cell proliferation 
including oncogenesis (Dhanasekaran, N. et al. (1998) Oncogene 17:1383-1394). Exchange of bound 
GDP for GTP followed by hydrolysis of GTP to GDP provides the energy that enables GTP-binding 
proteins to alter their conformation and interact with other cellular components. The superfamily of 

15 GTP-binding proteins consists of several families and may be grouped as translational factors, 

heterotrimeric GTP-binding proteins involved in transmembrane signaling processes (also called G- 
proteins), and low molecular weight GTP-binding proteins including the proto-oncogene Ras proteins 
and products of rab, rap, rho, rac, smg21, smg25, YPT, SEC4, and ARF genes, and tubulins (Kaziro, 
Y. et al. (1991) Ann. Rev. Biochem. 60:349-400). In all cases, the GTPase activity is regulated 

20 through interactions with other proteins. 

GTP-binding proteins involved in protein biosynthesis include initiation factor 2 (IF-2), 
elongation factor 2 (EF-Tu). and elongation factor G (EF-G), observed in prokaryotes; and initiation 
factor 2 (eIF-2), elongation factor la (EF-Icc) and elongation factor 2 (EF-2) observed in eukaryotes 
(Kaziro, supra *). IF-2 promotes the GTP-dependent binding of the tRNA to the small subunit of the 

25 ribosome, the step that initiates protein translation. Similarly, elongation factors promote the binding 
of tRNA and GTP and the displacement of GDP after hydrolysis as protein biosynthesis proceeds. 

Heterotrimeric GTP-binding proteins are composed of 3 subunits (a, p and y) which, in their 
inactive conformation, associate as a trimer at the inner face of the plasma membrane. Ga binds GDP 
or GTP and contains the GTPase activity. The py complex enhances binding of Ga to a receptor. 

30 Gy is necessary for the folding and activity of Gp. (Neer, E.J. et al. (1994) "Nature 371 :297-300.) 
Multiple homologs of each subunit have been identified in mammalian tissues, and different 
combinations of subunits have specific functions and tissue specificities. (Spiegel, A.M. (1997) J. 
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Inher. Metab. Dis. 20:1 13-121.) G protein activity is triggered by seven-transmembrane cell surface 
receptors (G-protein coupled receptors) which respond to lipid analogs, amino acids and their 
derivatives, peptides, cytokines, and specialized stimuli such as light, taste, and odor. Activation of 
the receptor by its stimulus causes the replacement of the G protein-bound GDP with GTP. Ga-GTP 
5 dissociates from the receptor/py complex and each of these separated components can interact with 
and regulate downstream effectors. The signaling stops when G<* hydrolyzes its bound GTP to GDP 
and reassociates with the Py complex (Neer, supra). 

The alpha subunits of heterotrimeric G proteins can be divided into four distinct classes. The 
a-s class is sensitive to ADP-ribosylation by pertussis toxin which uncouples the receptonG-protein 

10 interaction. This uncoupling blocks signal transduction to receptors that decrease cAMP levels which 
normally regulate ion channels and activate phospholipases. The inhibitory a-I class is also 
susceptible to modification by pertussis toxin which prevents a-I from lowering cAMP levels. Two 
novel classes of a subunits refractory to pertussis toxin modification are a-q, which activates 
phospholipase C, and a- 12, which has sequence homology with the Drosophila gene concertina and 

15 may contribute to the regulation of embryonic development (Simon, M.I. (1991) Science 252:802- 
808). 

The mammalian GP and Gy subunits, each about 340 amino acids long, share more than 80% 
homology. The GP subunit (also called transducin) contains seven repeating units, each about 43 
amino acids long. The activity of both subunits may be regulated by other proteins such as 

20 calmodulin and phosducin or the neural protein GAP 43 (D. Clapham and E. Neer, 1993, Nature 
365:403-406). The P and y subunits are tightly associated. The P subunit sequences are highly 
conserved between species, implying that they perform a fundamentally important role in the 
organization and function of G-protein linked systems (Van der Voorn L. (1992) Febs. Lett. 307 
(2): 13 1-1 34). They contain seven tandem repeats of the WD-repeat sequence motif, a motif found in 

25 many proteins with regulator}.' functions. WD-repeat proteins contain from four to eight copies of a 
loosely conserved repeat of approximately 40 amino acids which participates in protein-protein 
interactions. Mutations and variant expression of p transducin proteins are linked with various 
disorders. Mutations in LIS1. a subunit of the human platelet activating factor acetylhydrolase, cause 
Miller-Dieker lissencephaly. RACK1 binds activated protein kinase C, and RbAp48 binds 

30 retinoblastoma protein. CstF is required for polyadenylation of mammalian pre-mRNA in vitro and 
associates with subunits of cleavage-stimulating factor. Defects in the regulation of P-catenin 
contribute to the neoplastic transformation of human ceils. The WD40 repeats of the human F-box 
protein PTrCP mediate binding to (3-catenin, thus regulating the targeted degradation of P-catenin by 

2 



WO 00/31263 PCT/US99/28013 
ubiquitin ligase (Neer, supra; Hart, M. et al (1999) Curr. Biol. 9:207-210). The y subunit primary 
structures are more variable than those of the p subunits. They are often post-translationally modified 
by isoprenylation and carboxyl-methylation of a cysteine residue four amino acids from the C- 
terminus; this appears to be necessary for the interaction of the (3y subunit with the membrane and 
with other GTP-binding proteins. The Py subunit has been shown to modulate the activity of 
isoforms of adenylyl cyclase, phospholipase C, and some ion channels. It is involved in receptor 
phosphorylation via specific kinases, and has been implicated in the p2 1 ras-dependent activation of 
the MAP kinase cascade and the recognition of specific receptors by GTP-binding proteins. (Clapham 
and Neer, supra ). 

G-proteins interact with a variety of effectors including adenylyl cyclase (Clapham and Neer, 
s u pra)- The signaling pathway mediated by cAMP is mitogenic in hormone-dependent endocrine 
tissues such as adrenal cortex, thyroid, ovary, pituitary, and testes. Cancers in these tissues have been 
related to a mutationally activated form of a Gcc s known as the gsp (Gs protein) oncogene 
(Dhanasekaran, supra ). Another effector is phosducin, a retinal phosphoprotein, which forms a 
specific complex with retinal GP and Gy (GPy) and modulates the ability of GPy to interact with 
retinal Ga (Clapham and Neer, supra ). 

Irregularities in the GTP-binding protein signaling cascade may result in abnormal activation 
of leukocytes and lymphocytes, leading to the tissue damage and destruction seen in many 
inflammatory and autoimmune diseases such as rheumatoid arthritis, biliary cirrhosis, hemolytic 
anemia, lupus erythematosus, and thyroiditis. Abnormal celi proliferation, including cyclic AMP 
stimulation of brain, thyroid, adrenal, and gonadal tissue proliferation is regulated by G proteins. 
Mutations in G« subunits have been found in growth-hormone-secreting pituitary somatotroph 
tumors, hyperfunctioning thyroid adenomas, and ovarian and adrenal neoplasms (Meij, J.T.A. (1996) 
Mol. Cell. Biochem. 157:31-38; Aussel, supra). 

LMW GTP-binding proteins are GTPases which regulate cell growth, cell cycle control, 
protein secretion, and intracellular vesicle interaction. They consist of single polypeptides which, like 
the aipha subunit of the heterotrimeric GTP-binding proteins, are able to bind to and hydrolyze GTP, 
thus cycling between an inactive and an active state. LMW GTP-binding proteins respond to 
extracellular signals from receptors and activating proteins by transducing mitogenic signals involved 
in various cell functions. The binding and hydrolysis of GTP regulates the response of LMW GTP- 
binding proteins and acts as an energy source during this process (Bokoch, G. M. and Der, C. J. 
(1 993) FASEB J. 7:750-759). 

At least sixty members of the LMW GTP-binding protein superfamily have been identified _ 
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and are currently grouped into the ras, rho, arf, sari , ran, and rab subfamilies. Activated ras genes 
were initially found in human cancers, and subsequent studies confirmed that ras function is critical in 
determining whether cells continue to grow or become differentiated. Rasl and Ras2 proteins 
stimulate adenylate cyclase (Kaziro, supra), affecting a broad array of cellular processes. Stimulation 
5 of cell surface receptors activates Ras which, in turn, activates cytoplasmic kinases. These kinases 
translocate to the nucleus and activate key transcription factors that control gene expression and 
protein synthesis (Barbacid, M. (1987) Ann. Rev Biochem. 56:779-827, Treisman, R. (1994) Curr. 
Opin.Genet. Dev. 4:96-98). Other members of the LMW GTP-binding protein superfamily have roles 
in signal transduction that vary with the function of the activated genes and the locations of the GTP- 
10 binding proteins that initiate the activity. Rho GTP-binding proteins control signal transduction 
pathways that link growth factor receptors to actin polymerization, which is necessary for normal 
cellular growth and division. The rab, arf, and sar] families of proteins control the translocation of 
vesicles to and from membranes for protein processing, localization, and secretion. Vesicle- and 
target- specific identifiers (v-SNAREs and t-SNAREs) bind to each other and dock the vesicle to the 
15 acceptor membrane. The budding process is regulated by the closely related ADP ribosylation factors 
(ARFs) and SAR proteins, while rab proteins allow assembly of SNARE complexes and may play a 
role in removal of defective complexes (J. Rothman and F. Wieland (1996) Science 272:227-234). 
Ran GTP-binding proteins are located in the nucleus of cells and have a key role in nuclear protein 
import, the control of DNA synthesis, and cell-cycle progression (Hall, A. (1990) Science 249:635- 
20 640; Barbacid, M. (1987) Ann. Rev Biochem. 56:779-827; Ktistakis, N. (1998) BioEssays 20:495- 
504; and Sasaki, T. and Takai, Y. (1998) Biochem. Biophys. Res. Commun. 245:641-645). 

The cycling of LMW GTP-binding proteins between the GTP-bound active form and the 
GDP-bound inactive form is regulated by additional proteins. Guanosine nucleotide exchange factors 
(GEFs) increase the rate of nucleotide dissociation by several orders of magnitude, thus facilitating 
25 release of GDP and loading with GTP. The best characterized is the mammalian homoiogue of the 
Drosophila Son-of-Sevenless protein. Certain Ras-family proteins are also regulated by guanine 
nucleotide dissociation inhibitors (GDIs), which inhibit GDP dissociation. The intrinsic rate of GTP 
hydrolysis of the LMW GTP-binding proteins is typically very slow, but it can be stimulated by 
several orders of magnitude by GTPase-activating proteins (GAPs) (Geyer, M. and Wittinghofer, A. 
30 (1997) Curr. Opin. Struct. Biol. 7:786-792). Both GEF and GAP activity may be controlled in 
response to extracellular stimuli and modulated by accessory proteins such as RalBPl and POBL 
Mutant Ras-family proteins, which bind but can not hydrolyze GTP, are permanently activated, and 
cause cell proliferation or cancer, as do GEFs that inappropriately activate LMW GTP-binding 
proteins, such as the human oncogene NET1, a Rho-GEF (Drivas, G. T. et al. (1990) Mol. Cell. Biol.- 
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10:1793-1798; Alberts, A. S. and Treisman, R. (1998) EMBO J. 14:4075-4085). 

A novel group of GTP-binding proteins is the GTP1/OBG family, which are found in species 
ranging from bacteria to yeast to humans. These proteins contain characteristic GTP- binding motifs 
and are similar to one another but do not show sequence homology to other GTP-binding proteins. 
5 The exact functions of these proteins are as yet uncertain, but they have been shown to be important 
for regulation of cell differentiation and development (Okamoto, S. and Ochi, K. (1998). Mo!. 
Microbiol 30:107-1 19; Sazaka, T. et al. (1992) Biochem. Biophys. Res. Commun. 189:363-370). 

The discovery of new GTPase associated proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
10 prevention, and treatment of cell proliferative, autoimmune/inflammatory, and immune system 
disorders. 

SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, GTPase associated proteins, 

15 referred to collectively as "GTPAP" and individually as "GTPAP-1," "GTPAP-2," "GTPAP-3," 
"GTPAP-4," "GTPAP-5," "GTPAP-6," "GTPAP-7," "GTPAP-8," "GTPAP-9," "GTPAP-1 0," 
"GTPAP-1 1," "GTPAP-12 " "GTPAP-1 3," "GTPAP-14," "GTPAP-15," "GTPAP-16," "GTPAP-17," 
"GTPAP- 18," "GTPAP- 19," "GTPAP-20," "GTPAP-2 1," "GTPAP-22," "GTPAP-23," "GTPAP-24," 
"GTPAP-25," "GTPAP-26," "GTPAP-27," "GTPAP-28," and "GTPAP-29." In one aspect, the 

20 invention provides a substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-29 and fragments thereof. The invention also includes a 
polypeptide comprising an amino acid sequence that differs by one or more conservative amino acid 
substitutions from an amino acid sequence selected from the group consisting of SEQ ID NO: 1-29. 

The invention further provides a substantially purified variant having at least 90% amino acid 

25 identity to at least one of the amino acid sequences selected from the group consisting of SEQ ID 
NO: 1-29 and fragments thereof. The invention also provides an isolated and purified polynucleotide 
encoding the polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-29 and fragments thereof. The invention also includes an isolated and purified 
polynucleotide variant having at least 90% polynucleotide sequence identity to the polynucleotide 

30 encoding the polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-29 and fragments thereof. 

Additionally, the invention provides an isolated and purified polynucleotide which hybridizes 
under stringent conditions to the polynucleotide encoding the polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1 -29 and fragments thereof. The 
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invention also provides an isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide encoding the polypeptide comprising the amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 -29 and fragments thereof. 

The invention also provides a method for detecting a polynucleotide in a sample containing 
5 nucleic acids, the method comprising the steps of: (a) hybridizing the complement of the 

polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex; and (b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide in the sample. In one aspect, 
the method further comprises amplifying the polynucleotide prior to hybridization. 

10 The invention also provides an isolated and purified polynucleotide comprising a 

polynucleotide sequence selected from the group consisting of SEQ ID NO:30-58 and fragments 
thereof. The invention further provides an isolated and purified polynucleotide variant having at least 
90% polynucleotide sequence identity to the polynucleotide sequence selected from the group 
consisting of SEQ ID NO:30-58 and fragments thereof. The invention also provides an isolated and 

15 purified polynucleotide having a sequence which is complementary to the polynucleotide comprising 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:30-58 and fragments 
thereof. 

The invention further provides an expression vector containing at least a fragment of the 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the group 
20 consisting of SEQ ID NO: 1 -29. In another aspect, the expression vector is contained within a host 
cell. 

The invention also provides a method for producing a polypeptide, the method comprising the 
steps of: (a) culturing the host cell containing an expression vector containing a polynucleotide of the 
invention under conditions suitable for the expression of the polypeptide; and (b) recovering the 
25 polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially purified 
polypeptide having the amino acid sequence selected from the group consisting of SEQ ID NO: 1-29 
and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide selected from 
30 the group consisting of SEQ ID NO: 1 -29 and fragments thereof. The invention also provides a 
purified agonist and a purified antagonist to the polypeptide. 

The invention also provides a method for treating or preventing a disorder associated with 
decreased expression or activity of GTPAP, the method comprising administering to a subject in need 
of such treatment an effective amount of a pharmaceutical composition comprising a substantially - 
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purified polypeptide having the amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-29 and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention aiso provides a method for treating or preventing a disorder associated with 
increased expression or activity of GTPAP, the method comprising administering to a subject in need 
5 of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-29 and fragments thereof. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ IDNOs), 
10 clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding GTPAP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of GTPAP. 
Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
15 expression patterns of each nucleic acid sequence as determined by northern analysis; diseases, 
disorders, or conditions associated with these tissues; and the vector into which each cDNA was 
cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding GTPAP were isolated. 
20 Table 5 shows the tools, programs, and algorithms used to analyze GTPAP, along with 

applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
25 that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
30 and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
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meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
5 the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"GTPAP" refers to the amino acid sequences of substantially purified GTPAP obtained from 
10 any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
GTPAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of GTPAP either by directly interacting with 
15 GTPAP or by acting on components of the biological pathway in which GTPAP participates. 

An "allelic variant" is an alternative form of the gene encoding GTPAP. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
20 allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding GTPAP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as GTPAP or a 

25 polypeptide with at least one functional characteristic of GTPAP. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding GTPAP, and improper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
GTPAP. The encoded protein may also be "altered," and may contain deletions, insertions, or 

30 substitutions of amino acid residues which produce a silent change and result in a functionally 

equivalent GTPAP. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of GTPAP is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged * 
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amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoieucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 
5 The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 

polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to an amino acid sequence of a naturally 
occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino 
acid sequence to the complete native amino acid sequence associated with the recited protein 
10 molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
15 of GTPAP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of GTPAP either by 
directly interacting with GTPAP or by acting on components of the biological pathway in which 
GTPAP participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
20 thereof, such as Fab, F(ab ? ) : . and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind GTPAP polypeptides can be prepared using intact polypeptides or using 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
25 Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 

thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
30 immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition containing a nucleic acid sequence which is * 
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complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules may be 
produced by any method including synthesis or transcription. Once introduced into a cell, the 
complementary nucleotides combine with natural sequences produced by the cell to form duplexes 
and to block either transcription or translation. The designation "negative" or "minus" can refer to the 

5 antisense strand, and the designation "positive" or "plus" can refer to the sense strand. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" refers to the 
capability of the natural, recombinant, or synthetic GTPAP, or of any oligopeptide thereof, to induce a 
specific immune response in appropriate animals or cells and to bind with specific antibodies. 

10 The terms "complementary" and "complementarity" refer to the natural binding of 

polynucleotides by base pairing. For example, the sequence "5' A-G-T 3'" bonds to the 
complementary sequence "3' T-C-A 5'." Complementarity between two single-stranded molecules 
may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of complementarity 

15 between nucleic acid strands has significant effects on the efficiency and strength of the hybridization 
between the nucleic acid strands. This is of particular importance in amplification reactions, which 
depend upon binding between nucleic acid strands, and in the design and use of peptide nucleic acid 
(PNA) molecules. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
20 a given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding GTPAP or fragments of GTPAP may 
be employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
25 deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl 

sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using the XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or 
the 3' direction, and resequenced, or which has been assembled from the overlapping sequences of 
30 one or more Incyte Clones and, in some cases, one or more public domain ESTs, using a computer 
program for fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison 
WI). Some sequences have been both extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that, when made, least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
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protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 
as conservative amino acid substitutions. 



Original Residue 


Conservative Substitution 


Ala 


Gly, Ser 


Arg 


A ^ ^ IT- 


Asn 


Asp, Gin, His 


Asp 


Asn, vjiu 


Cys 




Gin 


Asn, ulu, His 




/\sp, oin, iilS 


Gly 




His 


Asn, Arg, Oln, CjIu 


lie 


Leu, Val 


Leu 


He, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, He 


Phe 


His, Met, Leu, Trp, Tyr 


Ser 


Cys, Thr 


Thr 


Ser, Val 


Trp 


Phe, Tyr 


Tyr 


His, Phe, Trp 


Val 


He, Leu, Thr 



25 Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 

backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobic ity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
30 absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, hydroxyl, or amino group. A derivative 
polynucleotide encodes a polypeptide which retains at least one biological or immunological function 
35 of the natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide from 
which it was derived. 

A "fragment" is a unique portion of GTPAP or the polynucleotide encoding GTPAP which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
40 to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 



WO 00/31263 PCT/US99/28013 

used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 1 0, 1 5, 
20, 25, 30, 40, 50, 60, 75, 1 00, 1 50, 250 or at least 500 contiguous nucleotides or amino acid residues 
in length. Fragments may be preferentially selected from certain regions of a molecule. For example, 
a polypeptide fragment may comprise a certain length of contiguous amino acids selected from the 
5 first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain defined 
sequence. Clearly these lengths are exemplary, and any length that is supported by the specification, 
including the Sequence Listing, tables, and figures, may be encompassed by the present embodiments. 

A fragment of SEQ ID NO:30-58 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO.30-58, for example, as distinct from any other sequence in the 

10 same genome. A fragment of SEQ ID NO:30-58 is useful, for example, in hybridization and 

amplification technologies and in analogous methods that distinguish SEQ ID NO.30-58 from related 
polynucleotide sequences. The precise length of a fragment of SEQ ID NO:30-58 and the region of 
SEQ ID NO.30-58 to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment. 

15 A fragment of SEQ ID NO: I -29 is encoded by a fragment of SEQ ID NO:30-58. A fragment 

of SEQ ID NO:l-29 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO: 1-29. For example, a fragment of SEQ ID NO: 1-29 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-29. The precise length of 
a fragment of SEQ ID NO: 1 -29 and the region of SEQ ID NO: 1 -29 to which the fragment 

20 corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

The term "similarity" refers to a degree of complementarity. There may be partial similarity 
or complete similarity. The word "identity" may substitute for the word "similarity." A partially 
complementary sequence that at least partially inhibits an identical sequence from hybridizing to a 

25 target nucleic acid is referred to as "substantially similar." The inhibition of hybridization of the 
completely complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or northern blot, solution hybridization, and the like) under conditions of reduced 
stringency. A substantially similar sequence or hybridization probe will compete for and inhibit the 
binding of a completely similar (identical) sequence to the target sequence under conditions of 

30 reduced stringency. This is not to say that conditions of reduced stringency are such that non-specific 
binding is permitted, as reduced stringency conditions require that the binding of two sequences to 
one another be a specific (i.e.. a selective) interaction. The absence of non-specific binding may be 
tested by the use of a second target sequence which lacks even a partial degree of complementarity 
(e.g., less than about 30% similarity or identity). In the absence of non-specific binding, the 
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substantially similar sequence or probe will not hybridize to the second non-complementary target 
sequence. 

The phrases "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
5 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
10 sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8: 1 89- 191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
15 weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
similarity" between aligned polynucleotide sequence pairs. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from 
20 several sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "biastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
25 Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example: 
30 Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 
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Expect: 10 
Word Size: 11 
Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 
by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the hydrophobicity and acidity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) with btastp set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
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Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific 
hybridization complexes form under permissive annealing conditions and remain hybridized after the 
"washing" step(s). The washing step(s) is particularly important in determining the stringency of the 
hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding 
between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 
SDS, and about 100 ug/ml denatured salmon sperm DMA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Generally, such wash temperatures are selected to be about 
5°C to 20°C lower than the thermal melting point (T ro ) for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook et al., 1989, 
Molecular Cloning: A Laboratory Manual 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NT; 
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specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration 
5 may be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as formamide at a 
concentration of about 35-50% v/v, may also be used under particular circumstances, such as for 
RNA:DNA hybridizations. Useful variations on these wash conditions will be readily apparent to 
10 those of ordinary skill in the art. Hybridization, particularly under high stringency conditions, may be 
suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative 
of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
15 hybridization complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
20 sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 
25 The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 

The terms "element" and "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of GTPAP. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
30 biological, functional, or immunological properties of GTPAP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 
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"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 

5 where necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, 

1 0 and may be pegylated to extend their lifespan in the cell. 

"Probe" refers to nucleic acid sequences encoding GTPAP, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 

1 5 "Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 

20 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

25 Methods for preparing and using probes and primers are described in the references, for 

example Sambrook et a!., 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel et al..l987. Current Protocols in Molecular Biology , 
Greene Publ. Assoc. & Witey-Intersciences, New York NY; Innis et al., 1990. PCR Protocols. A 
Guide to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs can be 

30 derived from a known sequence, for example, by using computer programs intended for that purpose 
such as Primer (Version 0.5. 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
1 00 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
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5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primer selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 

5 sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer 
selection program (available to the public from the Whitehead Institute/MIT Center for Genome 
Research, Cambridge MA) allows the user to input a "mispriming library," in which sequences to 
avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 

10 also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
selection of primers that hybridize to either the most conserved or least conserved regions of aligned 
nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 

] 5 oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of the above selection methods are useful in hybridization technologies, for 
example, as PCR or sequencing primers, microarray elements, or specific probes to identify fully or 
partially complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide 
selection are not limited to those described above. 

20 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook. supra . The term recombinant includes nucleic acids that have 

25 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
30 vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

The term "sample'* is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding GTPAP, or fragments thereof, or GTPAP itself, may comprise a bodily fluid; an 
extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA,* 
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RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 

5 structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
containing the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

10 removed from their natural environment and are isolated or separated, and are at least about 60% free, 
preferably about 75% free, and most preferably about 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acids or nucleotides by 
different amino acids or nucleotides, respectively. 

15 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"Transformation" describes a process by which exogenous DNA enters and changes a 

20 recipient cell. Transformation may occur under natural or artificial conditions according to various 
methods well known in the art, and may rely on any known method for the insertion of foreign nucleic 
acid sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected 
based on the type of host cell being transformed and may include, but is not limited to, viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term "transformed" cells 

25 includes stably transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 

30 the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or 
greater sequence identity over a certain defined length. A variant may be described as, for example, 
an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may * 
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have significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 

5 another. The resulting polypeptides generally will have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 

10 propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at ieast 50%, at 

1 5 least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at ieast 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

THE INVENTION 

The invention is based on the discovery of new human GTPase associated proteins (GTPAP), 
20 the polynucleotides encoding GTPAP, and the use of these compositions for the diagnosis, treatment, 
or prevention of cell proliferative, autoimmune/inflammatory, and immune system disorders. 

Table 1 lists the Inc\te clones used to assemble full length nucleotide sequences encoding 
GTPAP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 
polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte 
25 clones in which nucleic acids encoding each GTPAP were identified, and column 4 shows the cDNA 
libraries from which these clones were isolated. Column 5 shows Incyte clones and their 
corresponding cDNA libraries. Clones for which cDNA libraries are not indicated were derived from 
pooled cDNA libraries. The Incyte clones in column 5 were used to assemble the consensus 
nucleotide sequence of each GTPAP and are useful as fragments in hybridization technologies. 
30 The columns of Table 2 show various properties of each of the polypeptides of the invention: 

column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical 
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methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 
protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 

5 associated with nucleotide sequences encoding GTPAP. The first column of Table 3 lists the 

nucleotide SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These 
fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID 
NO:30-58 and to distinguish between SEQ ID NO:30-58 and related polynucleotide sequences. The 
polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 

10 3 lists tissue categories which express GTPAP as a fraction of total tissues expressing GTPAP. 

Column 4 lists diseases, disorders, or conditions associated with those tissues expressing GTPAP as a 
fraction of total tissues expressing GTPAP. Column 5 lists the vectors used to subclone each cDNA 
library. Of particular note is the specific expression of SEQ ID NO:43 in only one library, a human 
testis tissue library; the specific expression of SEQ ID NO:49 in only 4 libraries, one of which is 

15 associated with cell proliferation and 3 of which are associated with inflammation; and the specific 
expression of SEQ ID NO:40 in only 5 libraries, 3 of which are associated with cell proliferation and 
one of which is associated with inflammation. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding GTPAP were isolated. Column 1 references the nucleotide SEQ 

20 ID NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 
shows the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

The invention also encompasses GTPAP variants. A preferred GTPAP variant is one which 
has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the GTPAP amino acid sequence, and which contains at least one functional or 

25 structural characteristic of GTPAP. 

The invention also encompasses polynucleotides which encode GTPAP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ IDNO:30-58, which encodes GTPAP. 

The invention also encompasses a variant of a polynucleotide sequence encoding GTPAP. In 

30 particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 90%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding GTPAP. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 
NO:30-58 which has at least about 70%, or alternatively at least about 90%, or even at least about ~ 
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95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:30-58. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of GTPAP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

5 genetic code, a multitude of polynucleotide sequences encoding GTPAP, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 

10 polynucleotide sequence of naturally occurring GTPAP, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode GTPAP and its variants are generally capable 
of hybridizing to the nucleotide sequence of the naturally occurring GTPAP under appropriately 
selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 

15 GTPAP or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
naturally occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 
which particular codons are utilized by the host. Other reasons for substantially altering the 
nucleotide sequence encoding GTPAP and its derivatives without altering the encoded amino acid 

20 sequences include the production of RN A transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DMA sequences which encode GTPAP and 
GTPAP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 

25 using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding GTPAP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:30-58 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 

30 S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

1 52:507-5 1 1 .) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment ~ 
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of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Perkin- 
Elmer), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 

5 automated with machines such as the M1CROLAB 2200 liquid transfer system (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 
(Perkin-Elmer). Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing 
system (Perkin-Elmer), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, 
Sunnyvale CA), or other systems known in the art. The resulting sequences are analyzed using a 

10 variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short 

Protocols in Molecular Biology , John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) 
Molecular Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding GTPAP may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 

15 such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:3 1 8-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 

20 a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 

25 sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art. (See, e.g., Parker, J.D. et al. ( 1 99 1 ) Nucleic Acids Res. 1 9:3055-3060). 
Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 

30 commercially available software, such as OLIGO 4.06 Primer Analysis software (National 

Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
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size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-transcribed regulatory regions. 
5 Capillary electrophoresis systems which are commercially available may be used to analyze 

the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 

10 software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), and the entire process 
from loading of samples to computer analysis and electronic data display may be computer controlled. 
Capillary electrophoresis is especially preferable for sequencing small DNA fragments which may be 
present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 

1 5 which encode GTPAP may be cloned in recombinant DNA molecules that direct expression of 

GTPAP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and used to express GTPAP. 

The nucleotide sequences of the present invention can be engineered using methods generally 

20 known in the art in order to alter GTPAP-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 

25 sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

In another embodiment, sequences encoding GTPAP may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 
Alternatively, GTPAP itself or a fragment thereof may be synthesized using chemical methods. For 

30 example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 

Roberge, J.Y. et at. (1995) Science 269:202-204.) Automated synthesis may be achieved using the 
ABI 431 A peptide synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of GTPAP, or 
any part thereof, may be altered during direct synthesis and/or combined with sequences from other 
proteins, or any part thereof, to produce a variant polypeptide. 
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The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g.. Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, T. (1984) Proteins. Structures and Molecular Properties , WH Freeman, "New 
5 York NY.) 

In order to express a biologically active GTPAP, the nucleotide sequences encoding GTPAP 
or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 

10 constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in 
polynucleotide sequences encoding GTPAP. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding GTPAP. Such signals include the ATG initiation codon and adjacent sequences, 
e.g. the Kozak sequence. In cases where sequences encoding GTPAP and its initiation codon and 

15 upstream regulatory sequences are inserted into the appropriate expression vector, no additional 

transcriptional or translational control signals may be needed. However, in cases where only coding 
sequence, or a fragment thereof, is inserted, exogenous translational control signals including an in- 
frame ATG initiation codon should be provided by the vector. Exogenous translational elements and 
initiation codons may be of various origins, both natural and synthetic. The efficiency of expression 

20 may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used. 
(See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding GTPAP and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 

25 and in vivo genetic recombination. (See, e.g.. Sambrook. J. et al. (1989) Molecular Cloning. A 

Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
al. (1995) Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
30 encoding GTPAP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
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tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g.. Ti or pBR322 plasmids); or 
animal ceil systems. The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding GTPAP. For example, routine cloning, 
5 subcloning, and propagation of polynucleotide sequences encoding GTPAP can be achieved using a 
multifunctional E. coli vector such as PBLUESCRJPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding GTPAP into the vector's multiple 
cloning site disrupts the lacZ gene, allowing a coiorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 

10 in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. 
Chem. 264:5503-5509.) When large quantities of GTPAP are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of GTPAP may be used. For example, vectors 
containing the strong, inducible T5 or T7 bacteriophage promoter may be used. 

15 Yeast expression systems may be used for production of GTPAP. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the veast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 

20 1995. supra ; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, CA. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of GTPAP. Transcription of sequences 
encoding GTPAP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 

25 6:307-3 1 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 

promoters may be used. (See. e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843: and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation or 
pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of Science and Technology 

30 (1992) McGraw Hili, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding GTPAP may be ligated into 
an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
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sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses GTPAP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81 :3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
5 based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 
10 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
GTPAP in cell lines is preferred. For example, sequences encoding GTPAP can be transformed into 
cell lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 

15 introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media 

before being switched to selective media. The purpose of the selectable marker is to confer resistance 
to a selective agent, and its presence allows growth and recovery of cells which successfully express 
the introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

20 Any number of selection systems may be used to recover transformed cell lines. These 

include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 1 1:223-232; Lowy, 1. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 

25 methotrexate; neo confers resistance to the aminoglycosides neomycin and G-41 8; and ah and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et ai. (1981) 
J. Mol. Biol. 150: 1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 

30 Natl. Acad. Sci. USA 85:8047-8051 .) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech). B glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transform ants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
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(See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding GTPAP is inserted within a marker gene sequence, transformed cells containing 
5 sequences encoding GTPAP can be identified by the absence of marker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding GTPAP under the control of a 
single promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding GTPAP and that express 
10 GTPAP may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 
Immunological methods for detecting and measuring the expression of GTPAP using either 
15 specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on GTPAP is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. (See, 
20 e.g., Hampton, R. et al. (1990^ Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, 
Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and 
Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana 
Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
25 may be used in various nucleic acid and amino acid assays. Means for producing labeled 

hybridization or PCR probes for detecting sequences related to polynucleotides encoding GTPAP 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding GTPAP, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
30 and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison Wl). and US Biochemical. Suitable reporter molecules or labels which may be used for 
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ease of detection inciude radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding GTPAP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 

5 produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode GTPAP may be designed to contain signal sequences which 
direct secretion of GTPAP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 

10 inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 

15 post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and W138) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding GTPAP may be iigated to a heterologous sequence resulting in translation of a 

20 fusion protein in any of the aforementioned host systems. For example, a chimeric GTPAP protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of GTPAP activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 

25 maltose binding protein (MBP). thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate 
fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal- 
chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 

30 that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the GTPAP encoding sequence and the heterologous protein 
sequence, so that GTPAP may be cleaved away from the heterologous moiety following purification. 
Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra , ch. 10). 
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A variety of commercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled GTPAP may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
5 systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

Fragments of GTPAP may be produced not only by recombinant means, but also by direct 
peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) Protein 
10 synthesis may be performed by manual techniques or by automation. Automated synthesis may be 
achieved, for example, using the AB1 431 A peptide synthesizer (Perkin-Elmer). Various fragments of 
GTPAP may be synthesized separately and then combined to produce the full length molecule. 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 

15 between regions of GTPAP and GTPase associated proteins. In addition, the expression of GTPAP is 
closely associated with proliferating tissues associated with cancer and fetal development, inflamed 
tissues, and tissues invovled in the immune response. Therefore, GTPAP appears to play a role in 
cell proliferative, autoimmune/inflammatory, and immune system disorders. In the treatment of 
disorders associated with increased GTPAP expression or activity, it is desirable to decrease the 

20 expression or activity of GTPAP. In the treatment of disorders associated with decreased GTPAP 
expression or activity, it is desirable to increase the expression or activity of GTPAP. 

Therefore, in one embodiment, GTPAP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of GTPAP. Examples of such disorders include, but are not limited to, a ceil proliferative 

25 disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 

30 gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an autoimmune/inflammatory 
disorder, such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
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autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy- 
candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's 
disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia 
with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
5 glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 

10 syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fungal, parasitic, protozoal, and helminthic infections, and trauma; and an immune system disorder, 
such as acquired immunodeficiency syndrome (AIDS), X-linked agammaglobinemia of Bruton, 
common variable immunodeficiency (CVI), DiGeorge's syndrome (thymic hypoplasia), thymic 
dysplasia, isolated IgA deficiency, severe combined immunodeficiency disease (SCID), 

15 immunodeficiency with thrombocytopenia and eczema (Wiskott-Aldrich syndrome), Chediak- 
Higashi syndrome, chronic granulomatous diseases, hereditary angioneurotic edema, and 
immunodeficiency associated with Cushing's disease, leukemias such as multiple myeloma, and 
lymphomas such as Hodgkin's disease. 

In another embodiment, a vector capable of expressing GTPAP or a fragment or derivative 

20 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of GTPAP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
GTPAP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of GTPAP including, but not 

25 limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of GTPAP may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of GTPAP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of GTPAP may be administered to a subject to treat or 

30 prevent a disorder associated with increased expression or activity of GTPAP. Examples of such 
disorders include, but are not limited to, those ceil proliferative, autoimmune/inflammatory, and 
immune system disorders described above. In one aspect, an antibody which specifically binds 
GTPAP may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
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bringing a pharmaceutical agent to ceils or tissues which express GTPAP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding GTPAP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of GTPAP including, but not limited to, those described above. 
5 In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 

sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 
combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
10 various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of GTPAP may be produced using methods which are generally known in the 
art. In particular, purified GTPAP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind GTPAP. Antibodies to GTPAP may 
15 also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
20 and others may be immunized by injection with GTPAP or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oii emulsions, KLH, and dinitrophenol. Among adjuvants used in 
25 humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
GTPAP have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein and contain the 
30 entire amino acid sequence of a small, naturally occurring molecule. Short stretches of GTPAP 

amino acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric 
molecule may be produced. 

Monoclonal antibodies to GTPAP may be prepared using any technique which provides for 
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the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler. G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42: Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce GTPAP-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 
Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for GTPAP may also be generated. 
For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et al. ( 1 989) Science 246: 1 275- 1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between GTPAP and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering GTPAP epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for GTPAP. Affinity is expressed as an 
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association constant, K a . which is defined as the molar concentration of GTPAP-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The K a determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple GTPAP epitopes, represents the average affinity, or avidity, of the antibodies 
5 for GTPAP. The K a determined for a preparation of monoclonal antibodies, which are monospecific 
for a particular GTPAP epitope, represents a true measure of affinity. High-affinity antibody 
preparations with K a ranging from about 10 9 to 10 12 L/moie are preferred for use in immunoassays in 
which the GTPAP-antibody complex must withstand rigorous manipulations. Low-affinity antibody 
preparations with K a ranging from about 10 6 to 10 7 L/mole are preferred for use in 

10 immunopurification and similar procedures which ultimately require dissociation of GTPAP, 
preferably in active form, from the antibody (Catty, D. (1988) Antibodies. Volume I: A Practica l 
Approach . IRL Press, Washington, DC; Liddell, J.E. and Cryer, A. (1991) A Practical Guide to 
Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 

15 determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 
of GTPAP-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 
guidelines for antibody quality and usage in various applications, are generally available. (See, e.g., 

20 Catty, supra , and Coligan et al. supra .) 

In another embodiment of the invention, the polynucleotides encoding GTPAP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding GTPAP may be used in situations in which it would be 
desirable to block the transcription of the mRNA. In particular, cells may be transformed with 

25 sequences complementary to polynucleotides encoding GTPAP. Thus, complementary molecules or 
fragments may be used to modulate GTPAP activity, or to achieve regulation of gene function. Such 
technology is now well known in the art, and sense or antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
GTPAP. 

30 Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or 

from various bacterial plasmids. may be used for delivery of nucleotide sequences to the targeted 
organ, tissue, or cell population. Methods which are well known to those skilled in the art can be 
used to construct vectors to express nucleic acid sequences complementary to the polynucleotides 
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encoding GTPAP. (See, e.g.. Sambrook, supra ; Ausubel, 1995. supra .) 

Genes encoding GTPAP can be turned off by transforming a cell or tissue with expression 
vectors which express high levels of a polynucleotide, or fragment thereof, encoding GTPAP. Such 
constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in 
5 the absence of integration into the DNA, such vectors may continue to transcribe RNA molecules 
until they are disabled by endogenous nucleases. Transient expression may last for a month or more 
with a non-replicating vector, and may last even longer if appropriate replication elements are part of 
the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 

10 complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or 

regulatory regions of the gene encoding GTPAP. Oligonucleotides derived from the transcription 
initiation site, e.g., between about positions -10 and +10 from the start site, may be employed. 
Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 
pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for 

15 the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 

advances using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1 994) in 
Huber, B.E. and B.I. Carr, Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco 
NY, pp. 163-1 77.) A complementary sequence or antisense molecule may also be designed to block 
translation of mRNA by preventing the transcript from binding to ribosomes. 

20 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding GTPAP. 

25 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 1 5 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 

30 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
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for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding GTPAP. Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
5 constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into 
cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
10 linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 
queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

15 Many methods for introducing vectors into cells or tissues are available and equally suitable 

for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 

20 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
25 or sterile composition, in conjunction with a pharmaceutical ly acceptable carrier, for any of the 
therapeutic effects discussed above. Such pharmaceutical compositions may consist of GTPAP, 
antibodies to GTPAP, and mimetics, agonists, antagonists, or inhibitors of GTPAP. The 
compositions may be administered alone or in combination with at least one other agent, such as a 
stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical 
30 carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions 
may be administered to a patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
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intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain suitable 
pharmaceuticaily-acceptable carriers comprising excipients and auxiliaries which facilitate processing 
5 of the active compounds into preparations which can be used pharmaceutically. Further details on 
techniques for formulation and administration may be found in the latest edition o f Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. 
10 Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules (optionally, after 
grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 
15 excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, 
and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyl-cellulose. or sodium carboxymethylcellulose; gums, including arabic and 
tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and aiginic acid or a salt thereof, 
20 such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for product identification or to 
25 characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, 
30 the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or 
liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's 
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solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily 
injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, such as sesame oil, 
5 or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic 
amino polymers may also be used for delivery. Optionally, the suspension may also contain suitable 
stabilizers or agents to increase the solubility of the compounds and allow for the preparation of 
highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier to be 
10 permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a manner 
that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
15 acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, maiic, and succinic 
acids. Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding 
free base forms. In other cases, the preparation may be a tyophilized powder which may contain any 
or all of the following: 1 mM to 50 mM histidine, 0. 1 % to 2% sucrose, and 2% to 7% mannitol, at a 
pH range of 4.5 to 5.5, that is combined with buffer prior to use. 
20 After pharmaceutical compositions have been prepared, they can be placed in an appropriate 

container and labeled for treatment of an indicated condition. For administration of GTPAP, such 
labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
25 determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, or pigs. 
An animal model may also be used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 
30 administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
GTPAP or fragments thereof, antibodies of GTPAP, and agonists, antagonists or inhibitors of 
GTPAP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
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determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such 
as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the 
dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 50 /ED 5O ratio. Pharmaceutical compositions 
5 which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 
animal studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED J0 with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 
the sensitivity of the patient, and the route of administration. 

10 The exact dosage will be determined by the practitioner, in light of factors related to the 

subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 

15 response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 
days, every week, or biweekly depending on the half-life and clearance rate of the particular 
formulation. 

Normal dosage amounts may vary from about 0. 1 /ug to 1 00,000 jug, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
20 methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

25 In another embodiment, antibodies which specifically bind GTPAP may be used for the 

diagnosis of disorders characterized by expression of GTPAP, or in assays to monitor patients being 
treated with GTPAP or agonists, antagonists, or inhibitors of GTPAP. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for GTPAP include methods which utilize the antibody and a label to detect 

30 GTPAP in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covaient or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 
the art and may be used. 
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A variety of protocols for measuring GTPAP, including ELISAs, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of GTPAP expression. 
Normal or standard values for GTPAP expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibody to 
5 GTPAP under conditions suitable for complex formation. The amount of standard complex 

formation may be quantitated by various methods, such as photometric means. Quantities of GTPAP 
expressed in subject, control, and disease samples from biopsied tissues are compared with the 
standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

10 In another embodiment of the invention, the polynucleotides encoding GTPAP may be used 

for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of GTPAP may be correlated 
with disease. The diagnostic assay may be used to determine absence, presence, and excess 

15 expression of GTPAP, and to monitor regulation of GTPAP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding GTPAP or closely related molecules may be used 
to identify nucleic acid sequences which encode GTPAP. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 

20 conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding GTPAP, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the GTPAP encoding sequences. The hybridization probes of the subject 

25 invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:30-58 or from 
genomic sequences including promoters, enhancers, and introns of the GTPAP gene. 

Means for producing specific hybridization probes for DNAs encoding GTPAP include the 
cioning of polynucleotide sequences encoding GTPAP or GTPAP derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 

30 be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 

polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 



40 



WO 00/31263 PCT/US99/28013 

Polynucleotide sequences encoding GTPAP may be used for the diagnosis of disorders 
associated with expression of GTPAP. Examples of such disorders include, but are not limited to, a 
cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
5 hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 

10 autoimmune/inflammatory disorder, such as acquired immunodeficiency syndrome (AIDS), 

Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes meliitus, emphysema, 

15 episodic lymphopenia with lvmphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis. Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia. irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Re iter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 

20 anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and an 
immune system disorder, such as acquired immunodeficiency syndrome (AIDS), X-linked 
agammaglobinemia of Bruton. common variable immunodeficiency (CVI), DiGeorge's syndrome 

25 (thymic hypoplasia), thymic dysplasia, isolated IgA deficiency, severe combined immunodeficiency 
disease (SCID), immunodeficiency with thrombocytopenia and eczema (Wiskott-Aldrich syndrome), 
Chediak-Higashi syndrome, chronic granulomatous diseases, hereditary angioneurotic edema, and 
immunodeficiency associated with Cushing's disease, leukemias such as multiple myeloma, and 
lymphomas such as Hodgkin"s disease. The polynucleotide sequences encoding GTPAP may be used 

30 in Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR 

technologies; in dipstick, pin. and multiformat ELISA-Iike assays; and in microarrays utilizing fluids 
or tissues from patients to detect altered GTPAP expression. Such qualitative or quantitative methods 
are well known in the art. 
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]n a particular aspect, the nucleotide sequences encoding GTPAP may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding GTPAP may be labeled by standard methods and added to a fluid or tissue 
sample from a patient under conditions suitable for the formation of hybridization complexes. After a 
5 suitable incubation period, the sample is washed and the signal is quantified and compared with a 
standard value. If the amount of signal in the patient sample is significantly altered in comparison to 
a control sample then the presence of altered levels of nucleotide sequences encoding GTPAP in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 

10 monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
GTPAP, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding GTPAP, under conditions suitable for hybridization or 

15 amplification. Standard hybridization may be quantified by comparing the values obtained from 

normal subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

20 Once the presence of a disorder is established and a treatment protocol is initiated, 

hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

25 With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 

30 or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
GTPAP may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically. or produced in vitro . Oligomers will preferably contain a fragment of a 



42 



WO 00/31263 PCIYUS99/28013 

polynucleotide encoding GTPAP, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding GTPAP, and will be employed under optimized conditions for identification 
of a specific gene or condition. Oligomers may also be employed under less stringent conditions for 
detection or quantification of closely related DNA or RNA sequences. 
5 Methods which may also be used to quantify the expression of GTPAP include radiolabeling 

or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer of interest is 
10 presented in various dilutions and a spectrophotometric or colorimetroc response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The microarray 
can be used to monitor the expression level of large numbers of genes simultaneously and to identify 

15 genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, and to develop and 
monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 

20 USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding GTPAP may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 

25 The sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or 
to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial 
chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI constructions, or single 
chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355; Price, 
CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 7:149-154.) 

30 Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 

mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , 
pp. 965-968.) Examples of genetic map data can be found in various scientific journals or at the 
Online Mendelian Inheritance in Man (OMIM) World Wide Web site. Correlation between the 
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location of the gene encoding GTPAP on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
sequences among normal, carrier, and affected individuals. 
5 In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the number or arm of a particular human chromosome is not 
known. New sequences can be assigned to chromosomal arms by physical mapping. This provides 

10 valuable information to investigators searching for disease genes using positional cloning or other 
gene discovery techniques. Once the disease or syndrome has been crudely localized by genetic 
linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 
mapping to that area may represent associated or regulatory genes for further investigation. (See, 
e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the subject invention 

1 5 may also be used to detect differences in the chromosomal location due to translocation, inversion, 
etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, GTPAP, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 

20 solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between GTPAP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 

25 synthesized on a solid substrate. The test compounds are reacted with GTPAP, or fragments thereof, 
and washed. Bound GTPAP is then detected by methods well known in the art. Purified GTPAP can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

30 In another embodiment, one may use competitive drug screening assays in which neutralizing 

antibodies capable of binding GTPAP specifically compete with a test compound for binding 
GTPAP. In this manner, antibodies can be used to detect the presence of any peptide which shares 
one or more antigenic determinants with GTPAP. 
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In additional embodiments, the nucleotide sequences which encode GTPAP may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 
5 Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present invention to its fullest extent. The following embodiments are, 
therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
10 description, utilize the present invention to its fullest extent. The following preferred specific 

embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 
particular U.S. Ser. Nos. 60/109,592, 60/1 18,610, and 60/127,990 are hereby expressly incorporated 
15 by reference. 

EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
20 tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized 
and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a 
monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged 
over CsCl cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 
25 Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
30 purification kit (Ambton, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
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recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
1000 bp) using SEPHACRYL SI 000, SEPHAROSE CL2B, or SEPH AROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were iigated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), or pINCY (Incyte 
Pharmaceuticals, Palo Alto CA). Recombinant plasmids were transformed into competent E. col i 
cells including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or 
ElectroMAX DH10B from Life Technologies. 

II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision using the UN1ZAP vector 
system (Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: a 
Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep purification 
kit (Edge Biosystems, Gaithersburg MD); and Q1AWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, 
QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit 
from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and 
stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 

cDNA sequencing reactions were processed using standard methods or high-throughput 
instrumentation such as the ABI CATALYST 800 (Perkin-Elmer) thermal cycler or the PTC-200 
thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) 
or the MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were 
prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing 
kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin- 
Elmer). Electrophoretic separation of cDNA sequencing reactions and detection of labeled 
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polynucleotides were carried out using the MEGABACE 1000 DNA sequencing system (Molecular 
Dynamics); the ABI PRISM 373 or 377 sequencing system (Perkin-Eimer) in conjunction with 
standard ABI protocols and base calling software; or other sequence analysis systems known in the 
art. Reading frames within the cDNA sequences were identified using standard methods (reviewed in 
5 Ausubel, 1 997, supra , unit 7.7). Some of the cDNA sequences were selected for extension using the 
techniques disclosed in Example V. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable 

10 descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, 
programs, and algorithms used, the second column provides brief descriptions thereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probability values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher the score, the 

15 greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO 
software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 
(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using the default 
parameters specified by the clustal algorithm as incorporated into the MEGAL1GN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 

20 sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried 
against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, 

25 and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire 

annotation using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled 
into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The full length polynucleotide sequences were translated to derive the corresponding full length 

30 amino acid sequences, and these full length sequences were subsequently analyzed by querying 

against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such 
as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 
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families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide 
and amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO:30-58. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
5 and amplification technologies were described in The Invention section above. 

IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook. supra , ch. 7; Ausubel, 
10 1995, supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in nucleotide databases such as GenBank or LIFESEQ (Incyte Pharmaceuticals). This 
analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of 
the computer search can be modified to determine whether any particular match is categorized as 
15 exact or similar. The basis of the search is the product score, which is defined as: 
% sequence identity x % maximum BLAST score 
100 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will be exact 

20 within a 1% to 2% error, and. with a product score of 70, the match will be exact. Similar molecules 
are usually identified by selecting those which show product scores between 15 and 40, although 
lower scores may identify related molecules. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding GTPAP occurred. Analysis involved the categorization of cDNA libraries by 

25 organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 

30 Percentage values of tissue-specific and disease- or condition-specific expression are reported in 
Table 3. 

V. Extension of GTPAP Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:30-58 were produced by extension of 
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an appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3' extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
5 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

10 High fidelity amplification was obtained by PCR using methods well known in the art. PCR 

was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH 4 ) 2 S0 4 , 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 

15 pair PCI A and PCI B: Step 1 : 94°C, 3 min; Step 2: 94°C, 1 5 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

20 The concentration of DNA in each well was determined by dispensing 100 ul P1COGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 jxl of undiluted PCR product into each well of an opaque fiuorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 

25 concentration of DNA. A 5 y\ to 10 /u\ aliquot of the reaction mixture was analyzed by 

electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending 
the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-we!l plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
30 sonicated or sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 1 8 vector (Amersham 

49 
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Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, individual colonies were picked and cultured overnight at 37°C in 384- 
well plates in LB/2x carb liquid media. 
5 The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1 : 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 

10 recoveries were reamplified using the same conditions as described above. Samples were diluted 
with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 

In like manner, the nucleotide sequences of SEQ ID NO:30-58 are used to obtain 5' 

15 regulatory sequences using the procedure above, oligonucleotides designed for such extension, and an 
appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:30-58 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 

20 pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 piCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

25 SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: 
Ase I, Bgl II, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

30 membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.3 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
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compared. 

VII. Microarrays 

A chemical coupling procedure and an Inkjet device can be used to synthesize array 
elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra .) An array analogous to a 

5 dot or slot blot may also be used to arrange and link elements to the surface of a substrate using 

thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by hand 
or using available methods and machines and contain any appropriate number of elements. After 
hybridization, nonhybridized probes are removed and a scanner used to determine the levels and 
patterns of fluorescence. The degree of complementarity and the relative abundance of each probe 

10 which hybridizes to an element on the microarray may be assessed through analysis of the scanned 
images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may comprise 
the elements of the microarray. Fragments suitable for hybridization can be selected using software 
well known in the art such as LASERGENE software (DNASTAR). Full-length cDNAs, ESTs, or 

15 fragments thereof corresponding to one of the nucleotide sequences of the present invention, or 
selected at random from a cDNA library relevant to the present invention, are arranged on an 
appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., UV cross-linking 
followed by thermal and chemical treatments and subsequent drying. (See, e.g., Schena, M. et al. 
(1995) Science 270:467-470: Shalon, D. et al. (1996) Genome Res. 6:639-645.) Fluorescent probes 

20 are prepared and used for hybridization to the elements on the substrate. The substrate is analyzed by 
procedures described above. 

VIII. Complementary Polynucleotides 

Sequences complementary to the GTPAP-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring GTPAP. Although use of 

25 oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 

procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OL1GO 4.06 software (National Biosciences) and the coding sequence of GTPAP. To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique 5* sequence 
and used to prevent promoter binding to the coding sequence. To inhibit translation, a 

30 complementary oligonucleotide is designed to prevent ribosomai binding to the GTPAP-encoding 
transcript. 

IX. Expression of GTPAP 

Expression and purification of GTPAP is achieved using bacterial or virus-based expression 
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systems. For expression of GTPAP in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 

5 element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express GTPAP upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of GTPAP in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 

10 replaced with cDNA encoding GTPAP by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard. E.K. 

15 et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, GTPAP is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 

20 kilodalton enzyme from Schistosoma japonicum , enables the purification of fusion proteins on 

immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
GTPAP at specifically engineered sites. FLAG, an 8-amino acid peptide, enables imtnunoaffinity 
purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 

25 Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (Q1AGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra , ch. 10 and 16). Purified GTPAP obtained by these methods can be used directly in the 
following activity assay. 

X. Demonstration of GTPAP Activity 

30 The role of GTPAP can be assayed in vitro by monitoring the mobilization of Ca ++ as part of 

the signal transduction pathway. (See, e.g., Grynkievicz, G. et al. (1985) J. Biol. Chem. 260:3440; 
McColl, S. et al. (1993) J. Immunol. 150:4550-4555; and Aussel, C. et al. (1988) J. Immunol. 140- 
215.) The assay requires preloading neutrophils or T cells with a fluorescent dye such as FURA-2. 
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Upon binding Ca + \ FURA-2 exhibits an absorption shift that can be observed by scanning the 
excitation spectrum between 300 and 400 nm, while monitoring the emission at 5 10 nm. When the 
cells are exposed to one or more activating stimuli artificially (i.e., anti-CD3 antibody ligation of the 
T cell receptor) or physiologically (i.e., by allogeneic stimulation), Ca ++ flux takes place. Ca" + flux 
5 results from the release of Ca" T from intracellular organelles or from Ca ++ entry into the cell through 
activated Ca^ channels. This flux can be observed and quantified by assaying the cells in a 
fiuorometer or fluorescence activated ce!l sorter. Measurements of Ca +4 flux are compared between 
cells in their normal state and those preloaded with GTPAP. Increased mobilization attributable to 
increased GTPAP availability results in increased emission. 

10 Alternatively, GTPAP activity is measured by quantifying the amount of a non-hydrolyzable 

GTP analogue, GTPyS, bound over a 10 minute incubation period. Varying amounts of GTPAP are 
incubated at 30°C in 50mM Tris buffer, pH 7.5, containing ImM dithiothreitol, ImM EDTA and l,uM 
[ 35 S]GTPyS. Samples are passed through nitrocellulose filters and washed twice with a buffer 
consisting of 50mM Tris-HCl, pH 7.8, ImM NaN 3 , lOmM MgCl 2 , ImM EDTA, 0.5mM 

15 dithiothreitol, O.OlmM PMSF, and 200mM NaCl. The filter-bound counts are measured by liquid 
scintillation to quantify the amount of bound [ 35 S]GTPyS. GTPAP activity may also be measured as 
the amount of GTP hydrolysed over a 10 minute incubation period at 37°C. GTPAP is incubated in 
50mM Tris-HCl buffer, pH 7.8, containing ImM dithiothreitol, 2mM EDTA, 10/iM [a- 32 P]GTP, and 
l,uM H-rab protein. GTPase activity is initiated by adding MgCl 2 to a final concentration of 10 mM. 

20 Samples are removed at various time points, mixed with an equal volume of ice-cold 0.5mM EDTA, 
and frozen. Aliquots are spotted onto polyethyleneimine-cellulose thin layer chromatography plates, 
which are developed in 1M LiCl, dried, and autoradiographed. The signal detected is proportional to 
GTPAP activity. 

Alternatively, GTPAP activity may be demonstrated as the ability to interact with its 
25 associated Get or LMW GTPase in an in vitro binding assay. The candidate GTPases are expressed 
as fusion proteins with glutathione S-transferase (GST), and purified by affinity chromatography on 
glutathione-Sepharose. The GTPases are loaded with GDP by incubating 20 mM Tris buffer, pH 8.0, 
containing 100 mM NaCl, 2 mM EDTA, 5 mM MgC12, 0.2 mM DTT, 100 uM AMP-PNP and 10 U.M 
GDP at 30°C for 20 minutes. GTPAP is expressed as a FLAG fusion proteins in a baculovirus system. 
30 Extracts of these baculovirus cells containing GTPAP-FLAG fusion proteins are precleared with GST 
beads, then incubated with GST-GTPase fusion proteins. The complexes formed are precipitated by 
glutathione-Sepharose and separated by SDS-polyacrylamide gel electrophoresis. The separated 
proteins are blotted onto nitrocellulose membranes and probed with commercially available anti- 
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FLAG antibodies. GTPAP activity is proportional to the amount of GTPAP-FLAG fusion protein 
detected in the complex. 
XI. Functional Assays 

GTPAP function is assessed by expressing the sequences encoding GTPAP at 
5 physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA 
expression. Vectors of choice include pCMV SPORT (Life Technologies) and pCR3.1 (lnvitrogen, 
Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 pig of recombinant vector 
are transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell 

10 line, using either liposome formulations or electroporation. 1-2 /ig of an additional plasmid 

containing sequences encoding a marker protein are co-transfected. Expression of a marker protein 
provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor 
of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green 
Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), 

15 an automated, laser optics-based technique, is used to identify transfected cells expressing GFP or 
CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM detects 
and quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with 
cell death. These events include changes in nuclear DNA content as measured by staining of DNA 
with propidium iodide; changes in cell size and granularity as measured by forward light scatter and 

20 90 degree side light scatter: down-regulation of DNA synthesis as measured by decrease in 

bromodeoxyuridine uptake: alterations in expression of cell surface and intracellular proteins as 
measured by reactivity with specific antibodies; and alterations in plasma membrane composition as 
measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 
flow cytometry are discussed in Ormerod, M.G. (19941 Flow Cytometry , Oxford, New York NY. 

25 The influence of GTPAP on gene expression can be assessed using highly purified 

populations of cells transfected with sequences encoding GTPAP and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 
of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 

30 Success NY). mRNA can be purified from the cells using methods well known by those of skill in 
the art. Expression of mRNA encoding GTPAP and other genes of interest can be analyzed by 
northern analysis or microarray techniques. 
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XII. Production of GTPAP Specific Antibodies 

GTPAP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymoi. 1 82:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 
5 Alternatively, the GTPAP amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are weil 
described in the art. (See, e.g., Ausubel, 1995. supra , ch. 1 1 .) 

10 Typically, oligopeptides of about 1 5 residues in length are synthesized using an AB1 431 A 

peptide synthesizer (Perkin-EImer) using fmoc-chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995. supra .) Rabbits are immunized with the oligopeptide- 
KJLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti- 

15 GTPAP activity by, for example, binding the peptide or GTPAP to a substrate, blocking with 1% 
BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIII. Purification of Naturally Occurring GTPAP Using Specific Antibodies 
Naturally occurring or recombinant GTPAP is substantially purified by immunoaffinity 

chromatography using antibodies specific for GTPAP. An immunoaffinity column is constructed by 
20 covalently coupling anti-GTPAP antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 

blocked and washed according to the manufacturer's instructions. 

Media containing GTPAP are passed over the immunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of GTPAP (e.g., high ionic strength 
25 buffers in the presence of detergent). The column is eluted under conditions that disrupt 

antibody/GTPAP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 

as urea or thiocyanate ion), and GTPAP is collected. 

XIV. Identification of Molecules Which Interact with GTPAP 

GTPAP, or biologically active fragments thereof, are labeled with l25 I Bolton-Hunter 
30 reagent. (See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate 

molecules previously arrayed in the wells of a multi-well plate are incubated with the labeled GTPAP, 
washed, and any wells with labeled GTPAP complex are assayed. Data obtained using different 
concentrations of GTPAP are used to calculate values for the number, affinity, and association of 
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GTPAP with the candidate molecules. 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are 
obvious to those skilled in molecular biology or related fields are intended to be within the scope of 
the following claims. 
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Library Description 


This library was constructed using RNA isolated from the wrist synovial membrane tissue of 
a 62-year-old female with rheumatoid arthritis. 


This library was constructed using RNA isolated from brain meningioma tissue removed from a 
35-year-old female during excision of a cerebral meningeal lesion. Pathology indicated a 
benign neoplasm in the right cerebellopontine angle of the brain. Patient history included 
hypothyroidism. Family history included myocardial infarction and breast cancer. 


This library was constructed using RNA isolated from tumor tissue removed from the penis of 
a 64-year-old male during penile amputation. Pathology indicated a f ungating invasive grade 
4 squamous cell carcinoma involving the inner wall of the foreskin and extending onto the 
glans penis. Patient history included benign neoplasm of the large bowel, atherosclerotic 
coronary artery disease, angina pectoris, gout, and obesity. Family history included 
malignant pharyngeal neoplasm, chronic lymphocytic leukemia, and chronic liver disease. 


This library was constructed using RNA isolated from prostate tumor tissue removed from a 
65-year-old male during a radical prostatectomy. Pathology indicated an adenocarcinoma 
(Gleason grade 2+2). Adenof ibromatous hyperplasia was also present. The patient presented 
with elevated prostate specific antigen (PSA) . 


This library was constructed using RNA isolated from Saos-2, a primary osteogenic sarcoma 
cell line (ATCC HTB-85) derived from an 11-year-old Caucasian female. 


This library was constructed from 6.4 million independent clones from a uterine library. 
RNA for these libraries was isolated from uterine myometrial tissue removed from a 41-year- 
old female during a vaginal hysterectomy with dilation and curettage. The endometrium was 
secretory and contained fragments of endometrial polyps. Benign endo- and ectocervical 
mucosa were identified in the endocervix. Pathology for the associated tumor tissue 
indicated uterine leiomyoma. The normalization and hybridization conditions were adapted 
from Soares et al. ( Proc .Natl .Acad . Sci . USA (1994) 91:9928). 


This library was constructed using RNA isolated from brain tumor tissue removed from the 
parietal lobe of a 43-year-old female during excision of a cerebral meningeal lesion. 
Pathology indicated high grade glioma. Family history included acute myocardial infarction, 
atherosclerotic coronary artery disease, benign hypertension, and hyperlipidemia. 


This library was constructed using RNA isolated from the bone marrow of 24 male and female 
Caucasian donors, 16 to 70 years old. (RNA came from Clontech.) 
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1 . A substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-29 and fragments thereof. 

5 

2. A substantially purified variant having at least 90% amino acid sequence identity to 
the amino acid sequence of claim 1. 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1. 

10 

4. An isolated and purified polynucleotide variant having at least 90% polynucleotide 
sequence identity to the polynucleotide of claim 3. 

5. An isolated and purified polynucleotide which hybridizes under stringent conditions 
15 to the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide having a sequence which is complementary 
to the polynucleotide of claim 3. 

20 7. A method for detecting a polynucleotide, the method comprising the steps of: 

(a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in a 
sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of the polynucleotide in the sample. 

25 

8. The method of claim 7 further comprising amplifying the polynucleotide prior to 
hybridization. 

9. An isolated and purified polynucleotide comprising a polynucleotide sequence 
30 selected from the group consisting of SEQ ID NO:30-58 and fragments thereof. 

10. An isolated and purified polynucleotide variant having at least 90% polynucleotide 
sequence identity to the polynucleotide of claim 9. 
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11. An isolated and purified polynucleotide having a sequence which is complementary 
to the polynucleotide of claim 9. 



12. An expression vector comprising at least a fragment of the polynucleotide of claim 3. 

5 

13. A host cell comprising the expression vector of claim 12. 

14. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 13 under conditions suitable for the 
10 expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

15. A pharmaceutical composition comprising the polypeptide of claim 1 in conjunction 
with a suitable pharmaceutical carrier. 

15 

1 6. A purified antibody which specifically binds to the polypeptide of claim 1 . 

17. A purified agonist of the polypeptide of claim 1 . 
20 1 8. A purified antagonist of the polypeptide of claim 1 . 

19. A method for treating or preventing a disorder associated with decreased expression 
or activity of GTPAP, the method comprising administering to a subject in need of such treatment an 
effective amount of the pharmaceutical composition of claim 15. 

25 

20. A method for treating or preventing a disorder associated with increased expression 
or activity of GTPAP, the method comprising administering to a subject in need of such treatment an 
effective amount of the antagonist of claim 1 8. 
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DECLARATION AND POWER OF ATTORNEY FOR 
UNITED STATES PATENT APPLICATION 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

and 

I believe that I am the original, first and sole inventor (if only one name is listed below) 
or an original, first and joint inventor (if more than one name is listed below) of the subject 
matter which is claimed and for which a United States patent is sought on the invention entitled 

GTPASE ASSOCIATED PROTEINS 



the specification of which: 
/ / is attached hereto. 

/ / was filed on as application Serial No. and if this box 

contains an X / /, was amended on . 

/ X/ was filed as Patent Cooperation Treaty international application No. PCT/US99/28013 
on 23 November , 1999, if this box contains an X /_/, was amended on under Patent Cooperation 
Treaty Article 19 on 2001, and if this box contains an X /_/, was amended on 



I hereby state that I have reviewed and understand the contents of the above-identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge my duty to disclose information which is material to the examination of 
this application in accordance with Title 37, Code of Federal Regulations, § 1.56(a). 

I hereby claim the benefit under Title 35, United States Code, § 1 19 or §365(a)-(b) of any 
foreign application(s) for patent or inventor's certificate indicated below and of any Patent 
Cooperation Treaty international applications(s) designating at least one country other than the 
United States indicated below and have also identified below any foreign application(s) for 
patent or inventor's certificate and Patent Cooperation Treaty international application(s) 
designating at least one country other than the United States for the same subject matter and 
having a filing date before that of the application for said subject matter the priority of which is 
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Treaty international filing date of this application: 
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punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 
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<110> INCYTE PHARMACEUTICALS, INC. 
HILLMAN, Jennifer L. 
TANG , Y . Tom 
BANDMAN, Olga 
LAL , Preeti 
YUE , Henry 
LU, Dyung Aina M. 
BAUGHN, Mariah R. 
YANG, Junming 
AZIMZAI, Yalda 

<120> GTPASE ASSOCIATED PROTEINS 

<130> PF-0629 PCT 

<140> To Be Assigned 

<14l> Herewith 

<150> 60/109,592; 60/118,610; 60/127,990 

<151> 1998-11-23; 1999-02-04; 1999-04-06 

<160> 58 

<17 0> PERL Program 
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Cys Phe Glu Ser Phe Leu Val Val Arg Gly Leu Asp Met Glu Thr 

20 25 30 

Asp Arg Glu Arg Leu Arg Thr lie Tyr Asn Arg Asp Phe Lys lie 

35 40 45 

Ser Phe Gly Thr Pro Ala Pro Gly Phe Ser Ser Met Leu Tyr Gly 

50 55 60 
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Arg Arg Met Lys Leu Gly Ser Asp lie Ser Lys His His Lys Ser 

95 100 105 

Leu Leu Ala Lys lie Phe Tyr Asp Arg Ala Glu Tyr Leu His Gly 

110 115 120 

Lys His Gly Val Asp Val Glu Val Gin Gly Pro His Glu Ala Arg 
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225 


Ser 


Glu 


Gly Ala 


Gly 


Thr 


Phe 


Tyr 


He 


Ala 


Arg 


Phe 


Leu 


Ala 


Ala 










230 










235 










240 


Val 


Ala 


His 


Ser 


Pro 


Leu 


Ala 


Ala 


Gin 


Leu 


Lys 


Pro 


Met 


Thr 


Pro 










245 










250 










255 


Phe 


Lys 


Arg 


Thr 


Arg 


He 


Thr 


Gly 


Asn 


Pro 


Val 


Val 


Thr 


Asn 


Arg 










260 










265 










270 


He 


Glu 


Glu 


Gly 


Glu 


Arg 


Pro 


Asp 


Arg 


Ala 


Lys 


Gly 


Tyr 


Asp 


Leu 










275 










280 










285 


Glu 


Leu 


Ser 


Met 


Ala 


Leu 


Gly 


Thr 


Tyr 


Tyr 


Pro 


Pro 


Pro 


Arg 


Leu 










290 










295 










300 


Arg 


Gin 


Leu 


Leu 


Pro 


Met 


Leu 


Leu 


Gin 


Gly 


Thr 


Ser 


He 


Phe 


Thr 










305 










310 










315 


Ala 


Pro 


Lys 


Glu 


lie 


Ala 


Glu 


He 


Lys 


Ala 


Gin 


Leu 


Glu 


Thr 


Ala 










320 










325 










330 


Leu 


Lys 


Trp 


Arg 


Asn 


Tyr 


Glu 


Val 


Lys 


Leu 


Arg 


Leu 


Leu 


Leu 


His 










335 










340 










345 


Leu 


Glu 


Glu 


Leu 


Gin 


Met 


Glu 


His 


Asp 


He 


Arg 


His 


Tyr 


Asp 


Leu 










350 










355 










360 


Glu 


Ser 


Val 


Pro 


Met 


Thr 


Trp 


Asp 


Pro 


Val 


Asp 


Gin 


Asn 


Pro 


Arg 










365 










370 










375 


Leu 


Leu 


Thr 


Leu 


Glu 


Val 


Pro 


Gly Val 


Thr 


Glu 


Ser 


Arg 


Pro 


Ser 










380 










385 










390 


Val 


Leu 


Arg 


Gly Asp His 


Leu 


Phe 


Ala 


Leu 


Leu 


Ser 


Ser 


Glu 


Thr 










395 










400 










405 


His 


Gin 


Glu 


Asp 


Pro 


He 


Thr 


Tyr 


Lys 


Gly 


Phe 


Val 


His 


Lys 


Val 










410 










415 










420 


Glu 


Leu 


Asp 


Arg 


Val 




Leu 


Ser 


Phe 


Ser 


Met 


Ser 


Leu 


Leu 


Ser 










425 










430 










435 


Arg 


Phe 


Val 


Asp 


Gly 


Leu 


Thr 


Phe 


Lys 


Val 


Asn 


Phe 


Thr 


Phe 


Asn 










440 










445 










450 


Arg 


Gin 


Pro 


Leu 


Arg 


Val 


Gin 


His 


Arg 


Ala 


Leu 


Glu 


Leu 


Thr 


Gly 










455 










460 










465 


Arg 


Trp 


Leu 


Leu 


Trp 


Pro 


Met 


Leu 


Phe 


Pro 


Val 


Ala 


Pro 


Arg 


Asp 










470 










475 










480 


Val 


Pro 


Leu 


Leu 


Pro 


Ser 


Asp 


Val 


Lys 


Leu 


Lys 


Leu 


Tyr 


Asp 


Arg 










485 










490 










495 


Ser 


Leu 


Glu 


Ser 


Asn 


Pro 


Glu 


Gin 


Leu 


Gin 


Ala 


Met 


Arg 


His 


He 










500 










505 










510 


Val 


Thr 


Gly Thr 


Thr 


Arg 


Pro 


Ala 


Pro 


Tyr 


He 


He 


Phe 


Gly 


Pro 










515 










520 










525 


Pro 


Gly 


Thr 


Gly Lys Thr Val 


Thr 


Leu 


Val 


Glu 


Ala 


He 


Lys 


Gin 










530 










535 










540 
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Val Val Lys His Leu Pro Lys Ala His lie Leu Ala Cys Ala Pro 

545 550 555 

Ser Asn Ser Gly Ala Asp Leu Leu Cys Gin Arg Leu Arg Val His 

560 565 570 

Leu Pro Ser Ser lie Tyr Arg Leu Leu Ala Pro Ser Arg Asp lie 

575 580 585 

Arg Met Val Pro Glu Asp lie Lys Pro Cys Cys Asn Trp Asp Ala 

590 595 600 

Lys Lys Gly Glu Tyr Val Phe Pro Ala Lys Lys Lys Leu Gin Glu 

605 610 615 

Tyr Arg Val Leu lie Thr Thr Leu lie Thr Ala Gly Arg Leu Val 

620 625 630 

Ser Ala Gin Phe Pro lie Asp His Phe Thr His lie Phe lie Asp 

635 640 645 

Glu Ala Gly His Cys Met Glu Pro Glu Ser Leu Val Ala lie Ala 

650 655 660 

Gly Leu Met Glu Val Lys Glu Thr Gly Asp Pro Gly Gly Gin Leu 

665 670 675 

Val Leu Ala Gly Asp Pro Arg Gin Leu Gly Pro Val Leu Arg Ser 

680 685 690 

Pro Leu Thr Gin Lys His Gly Leu Gly Tyr Ser Leu Leu Glu Arg 

695 700 705 

Leu Leu lie Tyr Asn Ser Leu Tyr Lys Lys Gly Pro Asp Gly Tyr 

710 715 720 

Asp Pro Gin Phe lie Thr Lys Leu Leu Arg Asn Tyr Arg Ser His 

725 730 735 

Pro Thr lie Leu Asp lie Pro Asn Gin Leu Tyr Tyr Glu Gly Glu 

740 745 750 

Leu Gin Ala Cys Ala Asp Val Val Asp Arg Glu Arg Phe Cys Arg 

755 760 765 

Trp Ala Gly Leu Pro Arg Gin Gly Phe Pro lie lie Phe His Gly 

770 775 780 

Val Met Gly Lys Asp Glu Arg Glu Gly Asn Ser Pro Ser Phe Phe 

785 790 795 

Asn Pro Glu Glu Ala Ala Thr Val Thr Ser Tyr Leu Lys Leu Leu 

800 805 810 

Leu Ala Pro Ser Ser Lys Lys Gly Lys Ala Arg Leu Ser Pro Arg 

815 820 825 

Ser Val Gly Val lie Ser Pro Tyr Arg Lys Gin Val Glu Lys lie 

830 835 840 

Arg Tyr Cys lie Thr Lys Leu Asp Arg Glu Leu Arg Gly Leu Asp 

845 850 855 

Asp lie Lys Asp Leu Lys Val Gly Ser Val Glu Glu Phe Gin Gly 

860 865 870 

Gin Glu Arg Ser Val He Leu He Ser Thr Val Arg Ser Ser Gin 

875 880 885 

Ser Phe Val Gin Leu Asp Leu Asp Phe Asn Leu Gly Phe Leu Lys 

890 895 900 

Asn Pro Lys Arg Phe Asn Val Ala Val Thr Arg Ala Lys Ala Leu 

905 910 915 

Leu He He Val Gly Asn Pro Leu Leu Leu Gly His Asp Pro Asp 

920 925 930 

Trp Lys Val Phe Leu Glu Phe Cys Lys Glu Asn Gly Gly Tyr Thr 

935 940 945 

Gly Cys Pro Phe Pro Ala Lys Leu Asp Leu Gin Gin Gly Gin Asn 
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950 



955 



960 



Leu Leu Gin Gly Leu Ser Lys Leu Ser Pro Ser Thr Ser Gly Pro 

965 970 975 

His Ser His Asp Tyr Leu Pro Gin Glu Arg Glu Gly Glu Gly Gly 

980 985 990 

Leu Ser Leu Gin Val Glu Pro Glu Trp Arg Asn Glu 



<210> 2 

<211> 338 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1259937CD1 

<400> 2 

Met Ala Ala Leu Ala Gin Glu Asp Gly Trp Thr Lys Gly Gin Val 
15 10 15 

Leu Val Lys Val Asn Ser Ala Gly Asp Ala lie Gly Leu Gin Pro 
20 25 30 

Asp Ala Arg Gly Val Ala Thr Ser Leu Gly Leu Asn Glu Arg Leu 
35 40 45 

Phe Val Val Asn Pro Gin Glu Val His Glu Leu lie Pro His Pro 
50 55 60 

Asp Gin Leu Gly Pro Thr Val Gly Ser Ala Glu Gly Leu Asp Leu 
65 70 75 

Val Ser Ala Lys Asp Leu Ala Gly Gin Leu Thr Asp His Asp Trp 
80 85 90 

Ser Leu Phe Asn Ser lie His Gin Val Glu Leu lie His Tyr Val 
95 100 105 

Leu Gly Pro Gin His Leu Arg Asp Val Thr Thr Ala Asn Leu Glu 

110 115 120 

Arg Phe Met Arg Arg Phe Asn Glu Leu Gin Tyr Trp Val Ala Thr 

125 130 135 

Glu Leu Cys Leu Cys Pro Val Pro Gly Pro Arg Ala Gin Leu Leu 

140 145 150 

Arg Lys Phe lie Lys Leu Ala Ala His Leu Lys Glu Gin Lys Asn 

155 160 165 

Leu Asn Ser Phe Phe Ala Val Met Phe Gly Leu Ser Asn Ser Ala 

170 175 180 

lie Ser Arg Leu Ala His Thr Trp Glu Arg Leu Pro His Lys Val 

185 190 195 

Arg Lys Leu Tyr Ser Ala Leu Glu Arg Leu Leu Asp Pro Ser Trp 

200 205 210 

Asn His Arg Val Tyr Arg Leu Ala Leu Ala Lys Leu Ser Pro Pro 

215 220 225 

Val lie Pro Phe Met Pro Leu Leu Leu Lys Asp Met Thr Phe lie 

230 235 240 

His Glu Gly Asn His Thr Leu Val Glu Asn Leu He Asn Phe Glu 

245 250 255 

Lys Met Arg Met Met Ala Arg Ala Ala Arg Met Leu His His Cys 



995 



1000 



260 



265 



270 
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Arg Ser His Asn Pro Val Pro Leu Ser Pro Leu Arg Ser Arg Val 

275 280 285 

Ser His Leu His Glu Asp Ser Gin Val Ala Arg lie Ser Thr Cys 

290 295 300 

Ser Glu Gin Ser Leu Ser Thr Arg Ser Pro Ala Ser Thr Trp Ala 

305 310 315 

Tyr Val Gin Gin Leu Lys Val lie Asp Asn Gin Arg Glu Leu Ser 

320 325 330 

Arg Leu Ser Arg Glu Leu Glu Pro 

335 



<210> 3 

<211> 211 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1452285CD1 



Met 


Gin 


Ala 


Pro 


His 


Lys 


Glu 


His 


Leu 


Tyr 




Leu 




Val 


He 


1 








5 










10 










15 


Gly 


Asp 


Leu Gly 


Val 


Gly 


Lys 


Thr 


Ser 


He 


He 


Lys 


Arg 


Tyr 


Val 










20 










25 










30 


His 


Gin 


Asn 


Phe 


Ser 


Ser 


His 


Tyr 


Arg 


Ala 


Thr 


He 


Gly Val 


Asp 










35 










40 










45 


Phe 


Ala 


Leu 


Lys 


Val 


Leu 


His 


Trp 


Asp 


Pro 


Glu 


Thr 


Val 


Val 


Arg 










50 










55 










60 


Leu 


Gin 


Leu 


Trp 


Asp 


He 


Ala 


Gly 


Gin 


Glu 


Arg 


Phe 


Gly 




Met 










65 










70 










75 


Thr 


Arg 


Val 


Tyr 


Tyr 


Arg 


Glu 


Ala 


Met 


Gly 


Ala 


Phe 


He 


Val 


Phe 










80 










85 










90 


Asp 


Val 


Thr 


Arg 


Pro 


Ala 


Thr 


Phe 


Glu 


Ala 


Val 


Ala 


Lys 


Trp 


Lys 










95 










100 










105 


Asn 






Asp 


Ser 


Lys 




Ser 


Leu 


Pro 


Asn 


Gly 


Lys 




Val 










110 










115 










120 


Ser 


Val 


Val 


Leu 


Leu 


Ala 


Asn 


Lys 


Cys 


Asp 


Gin 


Gly 


Lys 


Asp 


Val 










125 










130 










135 


Leu 


Met 


Asn 


Asn 


Gly 


Leu 


Lys 


Met 


Asp 


Gin 


Phe 


Cys 


Lys 


Glu 


His 










140 










145 










150 


Gly 


Phe 


Val 


Gly 


Trp 


Phe 


Glu 


Thr 


Ser 


Ala 


Lys 


Glu 




He 












155 










16 0 










165 


He 


Asp 


Glu 


Ala 


Ser 


Arg 


Cys 


Leu 


Val 


Lys 


His 


He 


Leu 


Ala 


Asn 










170 










175 










180 


Glu 


Cys 


Asp 


Leu 


Met 


Glu 


Ser 


He 


Glu 


Pro 


Asp 


Val 


Val 


Lys 


Pro 










185 










190 










195 


His 


Leu 


Thr 


Ser 


Thr 


Lys 


Val 


Ala 


Ser 


Cys 


Ser 


Gly 


Cys 


Ala 


Lys 










200 










205 










210 



<210> 4 

<211> 516 



5/65 



WO 00/31263 



PCT/US99/28013 



<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1812894CD1 

<400> 4 

Met Glu Thr Met Lys Ala Val Ala Glu Val Ser Glu Ser Thr Lys 

15 10 15 

Ala Glu Ala Val Ala Ala Val Gin Arg Gin Cys Gin Glu Glu Val 

20 25 30 

Ala Ser Leu Gin Ala He Leu Lys Asp Ser He Ser Ser Tyr Glu 

35 40 45 

Ala Gin He Thr Ala Leu Lys Gin Glu Arg Gin Gin Gin Gin Gin 

50 55 60 

Asp Cys Glu Glu Lys Glu Arg Glu Leu Gly Arg Leu Lys Gin Leu 

65 70 75 

Leu Ser Arg Ala Tyr Pro Leu Asp Ser Leu Glu Lys Gin Met Glu 

80 85 90 

Lys Ala His Glu Asp Ser Glu Lys Leu Arg Glu He Val Leu Pro 

95 100 105 

Met Glu Lys Glu He Glu Glu Leu Lys Ala Lys Leu Leu Arg Ala 

110 115 120 

Glu Glu Leu He Gin Glu He Gin Arg Arg Pro Arg His Ala Pro 

125 130 135 

Ser Leu His Gly Ser Thr Glu Leu Leu Pro Leu Ser Arg Asp Pro 

140 145 150 

Ser Pro Pro Leu Glu Pro Leu Glu Glu Leu Ser Gly Asp Gly Gly 

155 160 165 

Pro Ala Ala Glu Ala Phe Ala His Asn Cys Asp Asp Ser Ala Ser 

170 175 180 
He Ser Ser Phe Ser Leu Gly Gly Gly Val Gly Ser Ser Ser Ser 

185 190 195 

Leu Pro Gin Ser Arg Gin Gly Leu Ser Pro Glu Gin Glu Glu Thr 

200 205 210 

Ala Ser Leu Val Ser Thr Gly Thr Leu Val Pro Glu Gly He Tyr 

215 220 225 

Leu Pro Pro Pro Gly Tyr Gin Leu Val Pro Asp Thr Gin Trp Glu 

230 235 240 

Gin Leu Gin Thr Glu Gly Arg Gin Leu Gin Lys Asp Leu Glu Ser 

245 250 255 

Val Ser Arg Glu Arg Asp Glu Leu Gin Glu Gly Leu Arg Arg Ser 

260 265 270 

Asn Glu Asp Cys Ala Lys Gin Met Gin Val Leu Leu Ala Gin Val 

275 280 285 

Gin Asn Ser Glu Gin Leu Leu Arg Thr Leu Gin Gly Thr Val Ser 

290 295 300 

Gin Ala Gin Glu Arg Val Gin Leu Gin Met Ala Glu Leu Val Thr 

305 310 315 

Thr His Lys Cys Leu His His Glu Val Lys Arg Leu Asn Glu Glu 

320 325 330 

Asn Gin Gly Leu Arg Ala Glu Gin Leu Pro Ser Ser Ala Pro Gin 

335 340 345 

Gly Ser Gin Gin Glu Gin Gly Glu Glu Glu Ser Leu Pro Ser Ser 
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350 355 360 

Val Pro Glu Leu Gin Gin Leu Leu Cys Cys Thr Arg Gin Glu Ala 

365 370 375 

Arg Ala Arg Leu Gin Ala Gin Glu His Gly Ala Glu Arg Leu Arg 

380 385 390 

He Glu He Val Thr Leu Arg Glu Ala Leu Glu Glu Glu Thr Val 

395 400 405 

Ala Arg Ala Ser Leu Glu Gly Gin Leu Arg Val Gin Arg Glu Glu 

410 415 420 

Thr Glu Val Leu Glu Ala Ser Leu Cys Ser Leu Arg Thr Glu Met 

425 430 435 

Glu Arg Val Gin Gin Glu Gin Ser Lys Ala Gin Leu Pro Asp Leu 

440 445 450 

Leu Ser Glu Gin Arg Ala Lys Val Leu Arg Leu Gin Ala Glu Leu 

455 450 465 

Glu Thr Ser Glu Gin Val Gin Arg Asp Phe Val Arg Leu Ser Gin 

470 475 480 

Ala Leu Gin Val Arg Leu Glu Arg He Arg Gin Ala Glu Thr Leu 

485 490 495 

Glu Gin Val Arg Ser He Met Asp Glu Ala Pro Leu Thr Asp Val 

500 505 510 

Arg Asp He Lys Asp Thr 

515 



<210> 5 

<211> 445 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3074884CD1 

<400> 5 

Met Pro Glu Asp Ala Asp Glu Asn Ala Glu Glu Glu Leu Leu Arg 

15 10 15 

Gly Glu Pro Leu Leu Pro Ala Gly Thr Gin Arg Val Cys Leu Val 

20 25 30 

His Pro Asp Val Lys Trp Gly Pro Gly Lys Ser Gin Met Thr Arg 

35 40 45 

Ala Glu Trp Gin Val Ala Glu Ala Thr Ala Leu Val His Thr Leu 

50 55 60 

Asp Gly Trp Ser Val Val Gin Thr Met Val Val Ser Thr Lys Thr 

65 70 75 

Pro Asp Arg Lys Leu He Phe Gly Lys Gly Asn Phe Glu His Leu 

80 85 90 

Thr Glu Lys He Arg Gly Ser Pro Asp Val Thr Cys Val Phe Leu 

95 100 105 

Asn Val Glu Arg Met Ala Ala Pro Thr Lys Lys Glu Leu Glu Ala 

110 115 120 

Ala Trp Gly Val Glu Val Phe Asp Arg Phe Thr Val Val Leu His 

125 130 135 

He Phe Arg Cys Asn Ala Arg Thr Lys Glu Ala Arg Leu Gin Val 

140 145 150 
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Ala Leu Ala Glu Met Pro Leu His Arg Ser Asn Leu Lys Arg Asp 

155 160 165 

Val Ala His Leu Tyr Arg Gly Val Gly Ser Arg Tyr lie Met Gly 

170 175 180 

Ser Gly Glu Ser Phe Met Gin Leu Gin Gin Arg Leu Leu Arg Glu 

185 ISO 195 

Lys Glu Ala Lys lie Arg Lys Ala Leu Asp Arg Leu Arg Lys Lys 

200 205 210 

Arg His Leu Leu Arg Arg Gin Arg Thr A.rg Arg Glu Phe Pro Val 

215 220 225 
lie Ser Val Val Gly Tyr Thr Asn Cys Gly Lys Thr Thr Leu lie 

230 235 240 
Lys Ala Leu Thr Gly Asp Ala Ala lie Gin Pro Arg Asp Gin Leu 

245 250 255 

Phe Ala Thr Leu Asp Val Thr Ala His Ala Gly Thr Leu Pro Ser 

260 265 270 

Arg Met Thr Val Leu Tyr Val Asp Thr lie Gly Phe Leu Ser Gin 

275 280 285 

Leu Pro His Gly Leu lie Glu Ser Phe Ser Ala Thr Leu Glu Asp 

290 295 300 

Val Ala His Ser Asp Leu lie Leu His Val Arg Asp Val Ser His 

305 310 315 

Pro Glu Ala Glu Leu Gin Lys Cys Ser Val Leu Ser Thr Leu Arg 

320 325 330 

Gly Leu Gin Leu Pro Ala Pro Leu Leu Asp Ser Met Val Glu Val 

335 340 345 

His Asn Lys Val Asp Leu Val Pro Gly Tyr Ser Pro Thr Glu Pro 

350 355 360 

Asn Val Val Pro Val Ser Ala Leu Arg Gly His Gly Leu Gin Glu 

365 370 375 

Leu Lys Ala Glu Leu Asp Ala Ala Val Leu Lys Ala Thr Gly Arg 

380 385 390 

Gin lie Leu Thr Leu Arg Val Arg Leu Ala Gly Ala Gin Leu Ser 

395 400 405 

Trp Leu Tyr Lys Glu Ala Thr Val Gin Glu Val Asp Val lie Pro 

410 415 420 

Glu Asp Gly Ala Ala Asp Val Arg Val lie lie Ser Asn Ser Ala 

425 430 435 

Tyr Gly Lys Phe Arg Lys Leu Phe Pro Gly 

440 445 



<210> 6 
<211> 445 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3452277CD1 
<400> 6 

Met Tyr Tyr Gin Gin Ala Leu Met Arg Ser Thr Val Lys Ser Ser 
15 10 15 

Val Ser Leu Gly Gly lie Val Lys Tyr Ser Glu Gin Phe Ser Ser 
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Asn Asp Ala lie Met Ser Gly Cys Leu Pro Ser Asn Pro Trp lie 
35 40 45 

Thr Asp Asp Thr Gin Phe Trp Asp Leu Asn Ala Lys Leu Val Glu 
50 55 60 

lie Pro Thr Lys Met Arg Val Glu Arg Trp Ala Phe Asn Phe Ser 
65 70 75 

Glu Leu lie Arg Asp Pro Lys Gly Arg Gin Ser Phe Gin Tyr Phe 
80 85 90 

Leu Lys Lys Glu Phe Ser Gly Glu Asn Leu Gly Phe Trp Glu Ala 
95 100 105 

Cys Glu Asp Leu Lys Tyr Gly Asp Gin Ser Lys Val Lys Glu Lys 

110 115 120 

Ala Glu Glu lie Tyr Lys Leu Phe Leu Ala Pro Gly Ala Arg Arg 

125 130 135 

Trp lie Asn lie Asp Gly Lys Thr Met Asp lie Thr Val Lys Gly 

140 145 150 

Leu Lys His Pro His Arg Tyr Val Leu Asp Ala Ala Gin Thr His 

155 160 165 

lie Tyr Met Leu Met Lys Lys Asp Ser Tyr Ala Arg Tyr Leu Lys 

170 175 180 

Ser Pro lie Tyr Lys Asp Met Leu Ala Lys Ala He Glu Pro Gin 

185 190 195 

Glu Thr Thr Lys Lys Ser Ser Thr Leu Pro Phe Met Arg Arg His 

200 205 210 

Leu Arg Ser Ser Pro Ser Pro Val He Leu Arg Gin Leu Glu Glu 

215 220 225 

Glu Ala Lys Ala Arg Glu Ala Ala Asn Thr Val Asp He Thr Gin 

230 235 240 

Pro Gly Gin His Met Ala Pro Ser Pro His Leu Thr Val Tyr Thr 

245 250 255 

Gly Thr Cys Met Pro Pro Ser Pro Ser Ser Pro Phe Ser Ser Ser 

260 265 270 

Cys Arg Ser Pro Arg Lys Pro Phe Ala Ser Pro Ser Arg Phe He 

275 280 285 

Arg Arg Pro Ser Thr Thr He Cys Pro Ser Pro He Arg Val Ala 

290 295 300 

Leu Glu Ser Ser Ser Gly Leu Glu Gin Lys Gly Glu Cys Ser Gly 

305 310 315 

Ser Met Ala Pro Arg Gly Pro Ser Val Thr Glu Ser Ser Glu Ala 

320 325 330 

Ser Leu Asp Thr Ser Trp Pro Arg Ser Arg Pro Arg Ala Pro Pro 

335 340 345 

Lys Ala Arg Met Ala Leu Ser Phe Ser Arg Phe Leu Arg Arg Gly 

350 355 360 

Cys Leu Ala Ser Pro Val Phe Ala Arg Leu Ser Pro Lys Cys Pro 

365 370 375 

Ala Val Ser His Gly Arg Val Gin Pro Leu Gly Asp Val Gly Gin 

380 385 390 

Gin Leu Pro Arg Leu Lys Ser Lys Arg Val Ala Asn Phe Phe Gin 

395 400 405 

He Lys Met Asp Val Pro Thr Gly Ser Gly Thr Cys Leu Met Asp 

410 415 420 

Ser Glu Asp Ala Gly Thr Gly Glu Ser Gly Asp Arg Ala Thr Glu 

425 430 435 
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Lys Glu Val lie Cys 
440 



Pro Trp Glu Ser Leu 
445 



<210> 7 

<211> 281 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4203832CD1 

<400> 7 

Met Lys Leu Ala Ala Met lie Lys Lys Met Cys Pro Ser Asp Ser 
15 10 15 

Glu Leu Ser lie Pro Ala Lys Asn Cys Tyr Arg Met Val lie Leu 
20 25 30 

Gly Ser Ser Lys Val Gly Lys Thr Ala lie Val Ser Arg Phe Leu 
35 40 45 

Thr Gly Arg Phe Glu Asp Ala Tyr Thr Pro Thr lie Glu Asp Phe 
50 55 60 

His Arg Lys Phe Tyr Ser lie Arg Gly Glu Val Tyr Gin Leu Asp 
65 70 75 

lie Leu Asp Thr Ser Gly Asn His Pro Phe Pro Ala Met Arg Cys 
80 85 90 

Leu Ser lie Leu Thr Gly Asp Val Phe lie Leu Val Phe Ser Leu 
95 100 105 

Asp Asn Arg Asp Ser Phe Glu Glu Val Gin Arg Leu Arg Gin Gin 

110 115 120 

lie Leu Asp Thr Lys Ser Cys Leu Lys Asn Lys Thr Lys Glu Asn 

125 130 135 

Val Asp Val Pro Leu Val lie Cys Gly Asn Lys Gly Asp Arg Asp 

140 145 150 

Phe Tyr Arg Glu Val Asp Gin Arg Glu lie Glu Gin Leu Val Gly 

155 160 165 

Asp Asp Pro Gin Arg Cys Ala Tyr Phe Glu lie Ser Ala Lys Lys 

170 175 180 

Asn Ser Ser Leu Asp Gin Met Phe Arg Ala Leu Phe Ala Met Ala 

185 190 195 

Lys Leu Pro Ser Glu Met Ser Pro Asp Leu His Arg Lys Val Ser 

200 205 210 

Val Gin Tyr Cys Asp Val Leu His Lys Lys Ala Leu Arg Asn Lys 

215 220 225 

Lys Leu Leu Arg Ala Gly Ser Gly Gly Gly Gly Gly Asp Pro Gly 

230 235 240 

Asp Ala Phe Gly lie Val Ala Pro Phe Ala Arg Arg Pro Ser Val 

245 250 255 

His Ser Asp Leu Met Tyr lie Arg Glu Lys Ala Ser Ala Gly Ser 

260 265 270 

Gin Ala Lys Asp Lys Glu Arg Cys Val lie Ser 



275 



280 



<210> 8 



<211> 301 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 104368CD1 

<400> 8 



Met 


Thr 


Thr 


Leu 


Asp 


Asp 


Lys 


Leu 


Leu 


Gly 


Glu 


Lys 


Leu 


Gin 


Tyr 


1 








5 










10 










15 


Tyr 


Tyr 


Ser 


Ser 


Ser 


Glu 


Asp 


Glu Asp 


Ser 


Asp 


His 


Glu 


Asp 


Lys 










20 










25 










30 


Asp 


Arg 


Gly Arg 


Cys 


Ala 


Pro 


Ala 


Ser 


Ser 


Ser 


Val 


Pro 


Ala 


Glu 










35 










40 










45 


Ala 


Glu 


Leu 


Ala 


Gly 


Glu 


Gly 


He 


Ser 


Val 


Asn 


Thr 


Gly 


Pro 


Lys 










50 










55 










60 


Gly Val 


He 


Asn 


Asp 


Trp 


Arg 


Arg 


Phe 


Lys 


Gin 


Leu 


Glu 


Thr 


Glu 










65 










70 










75 


Gin 


Arg 


Glu 


Glu 


Gin 
80 


Cys 


Arg 


Glu 


Met 


Glu 
85 


Arg 


Leu 


He 


Lys 


Lys 
90 


Leu 


Ser 


Met 


Thr 


Cys 


Arg 


Ser 


His 


Leu 


Asp 


Glu 


Glu 


Glu 


Glu 


Gin 










95 










100 










105 


Gin 


Lys 


Gin 


Lys 


Asp 


Leu 


Gin 


Glu 


Lys 


lie 


Ser 


Gly 


Lys 


Met 


Thr 










110 










115 










120 


Leu 


Lys 


Glu 


Phe 


Ala 


He 


Met 


Asn 


Glu 


Asp 


Gin 


Asp 




Glu 


Glu 










125 










130 










135 


Phe 


Leu 


Gin 


Gin 


Tyr 


Arg 


Lys 


Gin 


Arg 


Met 


Glu 


Glu 


Met 


Arg 


Gin 










140 










145 










150 


Gin 


Leu 


His 


Lys 


Gly 


Pro 


Gin 


Phe 


Lys 


Gin 


Val 


Phe 


Glu 


He 


Ser 










155 










160 










165 


Ser 


Gly Glu 


Gly 


Phe 


Leu 


Asp 


Met 


lie 


Asp 


Lys 


Glu 


Gin 


Lys 


Ser 










170 










175 










180 


lie 


Val 


He 


Met 


Val 


His 


He 


Tyr 


Glu 


Asp 


Gly 


He 


Pro 


Gly 


Thr 










185 










190 










195 


Glu 


Ala 


Met 


Asn 


Gly 


Cys 


Met 


He 


Cys 


Leu 


Ala 


Ala 


Glu 


Tyr 


Pro 










200 










205 










210 


Ala 


Val 


Lys 


Phe 


Cys 


Lys 


Val 


Lys 


Ser 


Ser 


Val 


lie 


Gly Ala 


Ser 










215 










220 










225 


Ser 


Gin 


Phe 


Thr 


Arg 


Asn 


Ala 


Leu 


Pro 


Ala 


Leu 


Leu 


He 


Tyr 












230 










235 










240 


Gly 


Gly 


Glu 


Leu 


He 


Gly 


Asn 


Phe 


Val 


Arg 


Val 


Thr 


Asp 


Gin 


Leu 










245 










250 










255 


Gly 




Asp 


Phe 


Phe 


Ala 


Val 


Asp 




Glu 


Ala 


Phe 


Leu 


Gin 


Glu 










260 










265 










270 


Phe 


Gly 


Leu 


Leu 


Pro 


Glu 


Lys 


Glu 


Val 


Leu 


Val 


Leu 


Thr 


Ser 


Val 










275 










280 










285 


Arg 


Asn 


Ser 


Ala 


Thr 


Cys 


His 


Ser 


Glu 


Asp 


Ser 


Asp 


Leu 


Glu 


He 










290 










295 










300 

































<210> 9 
<211> 485 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1441680CD1 

<400> 9 

Met Arg Ala Val Leu Thr Trp Arg Asp Lys Ala Glu His Cys lie 

15 10 15 

Asn Asp lie Ala Phe Lys Pro Asp Gly Thr Gin Leu lie Leu Ala 

20 25 30 

Ala Gly Ser Arg Leu Leu Val Tyr Asp Thr Ser Asp Gly Thr Leu 

35 40 45 

Leu Gin Pro Leu Lys Gly His Lys Asp Thr Val Tyr Cys Val Ala 

50 . 55 60 

Tyr Ala Lys Asp Gly Lys Arg Phe Ala Ser Gly Ser Ala Asp Lys 

65 70 75 

Ser Val He He Trp Thr Ser Lys Leu Glu Gly He Leu Lys Tyr 

80 85 90 

Thr His Asn Asp Ala He Gin Cys Val Ser Tyr Asn Pro lie Thr 

95 100 105 

His Gin Leu Ala Ser Cys Ser Ser Ser Asp Phe Gly Leu Trp Ser 

110 115 120 

Pro Glu Gin Lys Ser Val Ser Lys His Lys Ser Ser Ser Lys He 

125 130 135 

He Cys Cys Ser Trp Thr Asn Asp Gly Gin Tyr Leu Ala Leu Gly 

140 145 150 

Met Phe Asn Gly He He Ser He Arg Asn Lys Asn Gly Glu Glu 

155 160 165 

Lys Val Lys He Glu Arg Pro Gly Gly Ser Leu Ser Pro He Trp 

170 175 180 

Ser He Cys Trp Asn Pro Ser Arg Glu Glu Arg Asn Asp He Leu 

185 190 195 

Ala Val Ala Asp Trp Gly Gin Lys Val Ser Phe Tyr Gin Leu Ser 

200 205 210 

Gly Lys Gin He Gly Lys Asp Arg Ala Leu Asn Phe Asp Pro Cys 

215 220 225 

Cys He Ser Tyr Phe Thr Lys Gly Glu Tyr He Leu Leu Gly Gly 

230 235 240 

Ser Asp Lys Gin Val Ser Leu Phe Thr Lys Asp Gly Val Arg Leu 

245 250 255 

Gly Thr Val Gly Glu Gin Asn Ser Trp Val Trp Thr Cys Gin Ala 

260 265 270 

Lys Pro Asp Ser Asn Tyr Val Val Val Gly Cys Gin Asp Gly Thr 

275 280 285 

He Ser Phe Tyr Gin Leu He Phe Ser Thr Val His Gly Val Tyr 

290 295 300 

Lys Asp Arg Tyr Ala Tyr Arg Asp Ser Met Thr Asp Val He Val 

305 310 315 

Gin His Leu He Thr Glu Gin Lys Val Arg He Lys Cys Lys Glu 

320 325 330 

Leu Val Lys Lys He Ala He Tyr Arg Asn Arg Leu Ala He Gin 

335 340 345 

Leu Pro Glu Lys He Leu He Tyr Glu Leu Tyr Ser Glu Asp Leu 

350 355 360 
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Ser Asp Met His Tyr Arg Val Lys Glu Lys He He Lys Lys Phe 

365 370 375 

Glu Cys Asn Leu Leu Val Val Cys Ala Asn His He He Leu Cys 

380 385 390 

Gin Glu Lys Arg Leu Gin Cys Leu Ser Phe Ser Gly Val Lys Glu 

395 400 405 

Arg Glu Trp Gin Met Glu Ser Leu He Arg Tyr He Lys Val He 

410 415 420 

Gly Gly Pro Pro Gly Arg Glu Gly Leu Leu Val Gly Leu Lys Lys 

425 430 435 

Met Tyr Leu Leu Val Tyr Ser Phe He Leu He Val Lys Asp Tyr 

440 445 450 

Phe Ser Leu Ser Thr Asp Val Leu Gly Asn Leu Thr Trp Lys His 

455 4G0 465 

Val Cys Lys Lys His Tyr Trp Val Phe His Leu Phe Ser Trp Tyr 

470 475 480 

Tyr He Phe Val Gin 

485 



<210> 10 

<211> 447 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1494955CD1 
<400> 10 

Met Glu Leu Ser Gin Met Ser Glu Leu Met Gly Leu Ser Val Leu 

15 10 15 

Leu Gly Leu Leu Ala Leu Met Ala Thr Ala Ala Val Ala Arg Gly 

20 25 30 

Trp Leu Arg Ala Gly Glu Glu Arg Ser Gly Arg Pro Ala Cys Gin 

35 40 45 

Lys Ala Asn Gly Phe Pro Pro Asp Lys Ser Ser Gly Ser Lys Lys 

50 55 60 

Gin Lys Gin Tyr Gin Arg He Arg Lys Glu Lys Pro Gin Gin His 

65 70 75 

Asn Phe Thr His Arg Leu Leu Ala Ala Ala Leu Lys Ser His Ser 

80 85 90 

Gly Asn He Ser Cys Met Asp Phe Ser Ser Asn Gly Lys Tyr Leu 

95 100 105 

Ala Thr Cys Ala Asp Asp Arg Thr He Arg He Trp Ser Thr Lys 

110 115 120 

Asp Phe Leu Gin Arg Glu His Arg Ser Met Arg Ala Asn Val Glu 

125 130 135 

Leu Asp His Ala Thr Leu Val Arg Phe Ser Pro Asp Cys Arg Ala 

140 145 150 

Phe He Val Trp Leu Ala Asn Gly Asp Thr Leu Arg Val Phe Lys 

155 ISO 165 

Met Thr Lys Arg Glu Asp Gly Gly Tyr Thr Phe Thr Ala Thr Pro 

170 175 180 

Glu Asp Phe Pro Lys Lys His Lys Ala Pro Val He Asp He Gly 
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185 190 195 

He Ala Asn Thr Gly Lys Phe He Met Thr Ala Ser Ser Asp Thr 

200 205 210 

Thr Val Leu He Trp Ser Leu Lys Gly Gin Val Leu Ser Thr He 

215 220 225 

Asn Thr Asn Gin Met Asn Asn Thr His Ala Ala Val Ser Pro Cys 

230 235 240 

Gly Arg Phe Val Ala Ser Cys Gly Phe Thr Pro Asp Val Lys Val 

245 250 255 

Trp Glu Val Cys Phe Gly Lys Lys Gly Glu Phe Gin Glu Val Val 

260 265 270 

Arg Ala Phe Glu Leu Lys Gly His Ser Ala Ala Val His Ser Phe 

275 280 285 

Ala Phe Ser Asn Asp Ser Arg Arg Met Ala Ser Val Ser Lys Asp 

290 295 300 

Gly Thr Trp Lys Leu Trp Asp Thr Asp Val Glu Tyr Lys Lys Lys 

305 310 315 

Gin Asp Pro Tyr Leu Leu Lys Thr Gly Arg Phe Glu Glu Ala Ala 

320 325 330 

Gly Ala Ala Pro Cys Arg Leu Ala Leu Ser Pro Asn Ala Gin Val 

335 340 345 

Leu Ala Leu Ala Ser Gly Ser Ser He His Leu Tyr Asn Thr Arg 

350 355 360 

Arg Gly Glu Lys Glu Glu Cys Phe Glu Arg Val His Gly Glu Cys 

355 370 375 

He Ala Asn Leu Ser Phe Asp He Thr Gly Arg Phe Leu Ala Ser 

380 385 390 

Cys Gly Asp Arg Ala Val Arg Leu Phe His Asn Thr Pro Gly His 

395 400 405 

Arg Ala Met Val Glu Glu Met Gin Gly His Leu Lys Arg Ala Ser 

410 415 420 

Asn Glu Ser Thr Arg Gin Arg Leu Gin Gin Gin Leu Thr Gin Ala 

425 430 435 

Gin Glu Thr Leu Lys Ser Leu Gly Ala Leu Lys Lys 

440 445 



<210> 11 

<211> 199 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1508161CD1 

<400> 11 

Met Pro Val Lys Lys Lys His Arg Ala Arg Met He Glu Tyr Phe 

15 10 15 

He Asp Val Ala Arg Glu Cys Phe Asn He Gly Asn Phe Asn Ser 

20 25 30 

Leu Met Ala He He Ser Gly Met Asn Met Ser Pro Val Ser Arg 

35 40 45 

Leu Lys Lys Thr Trp Ala Lys Val Lys Thr Ala Lys Phe Asp He 

50 55 60 
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Leu Glu His Gin Met Asp Pro Ser Ser Asn Phe Tyr Asn Tyr Arg 

65 70 75 

Thr Ala Leu Arg Gly Ala Ala Gin Arg Ser Leu Thr Ala His Ser 

80 B5 90 

Ser Arg Glu Lys He Val He Pro Phe Phe Ser Leu Leu He Lys 

95 100 105 

Asp He Tyr Phe Leu Asn Glu Gly Cys Ala Asn Arg Leu Pro Asn 

110 115 120 

Gly His Val Asn Phe Glu Lys Phe Trp Glu Leu Ala Lys Gin Val 

125 130 135 

Ser Glu Phe Met Thr Trp Lys Gin Val Glu Cys Pro Phe Glu Arg 

140 145 150 

Asp Arg Lys He Leu Gin Tyr Leu Leu Thr Val Pro Val Phe Ser 

155 160 155 

Glu Asp Ala Leu Tyr Leu Ala Ser Tyr Glu Ser Glu Gly Pro Glu 

170 175 180 

Asn His He Glu Lys Asp Arg Trp Lys Ser Leu Arg Ser Ser Leu 

185 190 195 

Leu Gly Arg Val 



<210> 12 
<211> 694 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1811877CD1 

<400> 12 

Met Ala Phe Asp Pro Thr Ser Thr Leu Leu Ala Thr Gly Gly Cys 

15 10 15 

Asp Gly Ala Val Arg Val Trp Asp He Val Arg His Tyr Gly Thr 

20 25 30 

His His Phe Arg Gly Ser Pro Gly Val Val His Leu Val Ala Phe 

35 40 45 

His Pro Asp Pro Thr Arg Leu Leu Leu Phe Ser Ser Ala Thr Asp 

50 55 60 

Ala Ala He Arg Val Trp Ser Leu Gin Asp Arg Ser Cys Leu Ala 

65 70 75 

Val Leu Thr Ala His Tyr Ser Ala Val Thr Ser Leu Ala Phe Ser 

80 85 90 

Ala Asp Gly His Thr Met Leu Ser Ser Gly Arg Asp Lys He Cys 

95 100 105 

He He Trp Asp Leu Gin Ser Cys Gin Ala Thr Arg Thr Val Pro 

110 115 120 

Val Phe Glu Ser Val Glu Ala Ala Val Leu Leu Pro Glu Glu Pro 

125 130 135 

Val Ser Gin Leu Gly Val Lys Ser Pro Gly Leu Tyr Phe Leu Thr 

140 145 150 

Ala Gly Asp Gin Gly Thr Leu Arg Val Trp Glu Ala Ala Ser Gly 

155 160 165 

Gin Cys Val Tyr Thr Gin Ala Gin Pro Pro Gly Pro Gly Gin Glu 

170 175 180 
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Leu 


Thr 


His 


Cys 


Thr 


Leu 


Ala 


His 


Thr 


Ala 


Gly 


Val 


Val 


Leu 


Thr 








185 










190 










195 


Ala 


Thr 


Ala 


Asp 


His 


Asn 


Leu 


Leu 


Leu 


Tyr 


Glu 


Ala 


Arg 


Ser 


Leu 










200 










205 










210 


Arg 


Leu 


Gin 


Lys 


Gin 


Phe 


Ala 


Gly 


Tyr 


Ser 


Glu 


Glu 


Val 


Leu 


Asp 










215 










220 










225 


Val 


Arg 


Phe 


Leu 


Gly 


Pro 


Glu 


Asp 


Ser 


His 


Val 


Val 


Val 


Ala 


Ser 










230 










235 










240 


Asn 


Ser 


Pro 


Cys 


Leu 


Lys 


Val 


Phe 


Glu 


Leu 


Gin 


Thr 


Ser 


Ala 


Cys 










245 










250 










255 


Gin 


He 


Leu 


His 


Gly 


His 


Thr 


Asp 


He 


Val 


Leu 


Ala 


Leu 


Asp 


Val 










260 










265 










270 


Phe 


Arg 


Lys 


Gly 


Trp 


Leu 


Phe 


Ala 


Ser 


Cys 


Ala 


Lys 


Asp 


Gin 


Ser 










275 










280 










285 


Val 


Arg 


He 


Trp 


Arg 


Met 


Asn 


Lys 


Ala 


Gly 


Gin 


Val 


Met 


Cys 


Val 










290 










295 










300 


Ala 


Gin 


Gly 


Ser 


Gly 


His 


Thr 


His 


Ser 


Val 


Gly 


Thr 


Val 


Cys 


Cys 










305 










310 










315 


Ser 


Arg 


Leu 


Lys 


Glu 


Ser 


Phe 


Leu 


Val 


Thr 


Gly Ser 


Gin 


Asp 


Cys 










320 










325 










330 


Thr 


Val 


Lys 


Leu 


Trp 


Pro 


Leu 


Pro 


Lys 


Ala 


Leu 


Leu 


Ser 


Lys 


Asn 










335 










340 










345 


Thr 


Ala 


Pro 


Asp 


Asn 


Gly 


Pro 


He 


Leu 


Leu 


Gin 


Ala 


Gin 


Thr 


Thr 










350 










355 










360 


Gin 


Arg 


Cys 


His 


Asp 


Lys 


Asp 


He 


Asn 


Ser 


Val 


Ala 


lie 


Ala 


Pro 










365 










370 










375 


Asn 


Asp 


Lys 


Leu 


Leu 


Ala 


Thr Gly 


Ser 


Gin 


Asp 


Arg 


Thr 


Ala 


Lys 










380 










385 










390 


Leu 


Trp 


Ala 


Leu 


Pro 


Gin 


Cys 


Gin 


Leu 


Leu 


Gly Val 


Phe 


Ser 


Gly 










395 










400 










405 


His 


Arg 


Arg 


Gly 


Leu 


Trp 


Cys 


Val 


Gin 


Phe 


Ser 


Pro 


Met 


Asp 


Gin 










410 










415 










420 


val 


Leu 


Ala 


Thr 


Ala 


Ser 


Ala 


Asp Gly 


Thr 


He 


Lys 


Leu 


Trp 


Ala 










425 










430 










435 


Leu 


Gin 


Asp 


Phe 


Ser 


Cys 


Leu 


Lys 


Thr 


Phe 


Glu 


Gly His 


Asp 


Ala 










440 










445 










450 


Ser 


Val 


Leu 


Lys 


Val 


Ala 


Phe 


Val 


Ser 


Arg 


Gly 


Thr 


Gin 


Leu 


Leu 










455 










460 










465 


Ser 


Ser 


Gly 


Ser 


Asp 


Gly 


Leu 


Val 


Lys 


Leu 


Trp 


Thr 


He 


Lys 


Asn 










470 










475 










480 


Asn 


Glu 


Cys 


Val 


Arg 


Thr 


Leu 


Asp 


Ala 


His 


Glu 


Asp 


Lys 


Val 


Trp 










485 










490 










495 


Gly 


Leu 


His 


Cys 


Ser 


Arg 


Leu 


Asp 




His 


Ala 




Thr 


Gly 


Ala 










500 










505 










510 


Ser 


Asp 


Ser 


Arg 


Val 


He 


Leu 


Trp 


Lys 


Asp 


Val 


Thr 


Glu 


Ala 


Glu 










515 










520 










525 


Gin 


Ala 


Glu 


Glu 


Gin 


Ala 


Arg 


Gin 


Glu 


Glu 


Gin 


Val 


Val 


Arg 


Gin 










530 










535 










540 


Gin 


Glu 


Leu 


Asp 


Asn 


Leu 


Leu 


His 


Glu 


Lys 




Tyr 


Leu 














545 










550 










555 


Leu 


Gly 


Leu 


Ala 


He 


Ser 


Leu 


Asp 


Arg 


Pro 


His 


Thr 


Val 


Leu 


Thr 










560 










565 










570 


Val 


He 


Gin 


Ala 


He 


Arg 


Arg 


Asp 


Pro 


Glu 


Ala 


Cys 


Glu 


Lys 


Leu 










575 










580 










585 


Glu 


Ala 


Thr 


Met 


Leu Arg 


Leu 


Arg Arg 


Asp 


Gin 


Lys 


Glu 


Ala 


Leu 
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590 595 600 

Leu Arg Phe Cys Val Thr Trp Asn Thr Asn Ser Arg His Cys His 

505 610 615 

Glu Ala Gin Ala Val Leu Gly Val Leu Leu Arg Arg Glu Ala Pro 

620 625 630 

Glu Glu Leu Leu Ala Tyr Glu Gly Val Arg Ala Ala Leu Glu Ala 

635 640 645 

Leu Leu Pro Tyr Thr Glu Arg His Phe Gin Arg Leu Ser Arg Thr 

650 655 660 

Leu Gin Ala Ala Ala Phe Leu Asp Phe Leu Trp His Asn Met Lys 

665 670 675 

Leu Pro Val Pro Ala Ala Ala Pro Thr Pro Trp Glu Thr His Lys 

680 685 690 

Gly Ala Leu Pro 



<210> 13 

<211> 654 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1848674CD1 

<400> 13 

Met Glu Arg Ser Gly Pro Ser Glu Val Thr Gly Ser Asp Ala Ser 

15 10 15 

Gly Pro Asp Pro Gin Leu Ala Val Thr Met Gly Phe Thr Gly Phe 

20 25 30 

Gly Lys Lys Ala Arg Thr Phe Asp Leu Glu Ala Met Phe Glu Gin 

35 40 45 

Thr Arg Arg Thr Ala Val Glu Arg Ser Arg Lys Thr Leu Glu Ala 

50 55 60 

Arg Glu Lys Glu Glu Glu Met Asn Arg Glu Lys Glu Leu Arg Arg 

65 70 75 

Gin Asn Glu Asp He Glu Pro Thr Ser Ser Arg Ser Asn Val Val 

80 85 90 

Arg Asp Cys Ser Lys Ser Ser Ser Arg Asp Thr Ser Ser Ser Glu 

95 100 105 

Ser Glu Gin Ser Ser Asp Ser Ser Asp Asp Glu Leu He Gly Pro 

110 115 120 

Pro Leu Pro Pro Lys Met Val Gly Lys Pro Val Asn Phe Met Glu 

125 130 135 

Glu Asp He Leu Gly Pro Leu Pro Pro Pro Leu Asn Glu Glu Glu 

140 145 150 

Glu Glu Ala Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Asn 

155 160 165 

Pro Val His Lys He Pro Asp Ser His Glu He Thr Leu Lys His 

170 175 180 

Gly Thr Lys Thr Val Ser Ala Leu Gly Leu Asp Pro Ser Gly Ala 

185 190 195 

Arg Leu Val Thr Gly Gly Tyr Asp Tyr Asp Val Lys Phe Trp Asp 

200 205 210 

Phe Ala Gly Met Asp Ala Ser Phe Lys Ala Phe Arg Ser Leu Gin 
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215 220 225 

Pro Cys Glu Cys His Gin lie Lys Ser Leu Gin Tyr Ser Asn Thr 

230 235 240 

Gly Asp Met He Leu Val Val Ser Gly Ser Ser Gin Ala Lys Val 

245 250 255 

lie Asp Arg Asp Gly Phe Glu Val Met Glu Cys He Lys Gly Asp 

260 265 270 

Gin Tyr He Val Asp Met Ala Asn Thr Lys Gly His Thr Ala Met 

275 280 285 

Leu His Thr Gly Ser Trp His Pro Lys He Lys Gly Glu Phe Met 

290 295 300 

Thr Cys Ser Asn Asp Ala Thr Val Arg Thr Trp Glu Val Glu Asn 

305 310 315 

Pro Lys Lys Gin Lys Ser Val Phe Lys Pro Arg Thr Met Gin Gly 

320 325 330 

Lys Lys Val He Pro Thr Thr Cys Thr Tyr Ser Arg Asp Gly Asn 

335 340 345 

Leu He Ala Ala Ala Cys Gin Asn Gly Ser He Gin He Trp Asp 

350 355 360 

Arg Asn Leu Thr Val His Pro Lys Phe His Tyr Lys Gin Ala His 

365 370 375 

Asp Ser Gly Thr Asp Thr Ser Cys Val Thr Phe Ser Tyr Asp Gly 

380 385 390 
Asn Val Leu Ala Ser Arg Gly Gly Asp Asp Ser Leu Lys Leu Trp 

395 400 405 

Asp He Arg Gin Phe Asn Lys Pro Leu Phe Ser Ala Ser Gly Leu 

410 415 420 

Pro Thr Met Phe Pro Met Thr Asp Cys Cys Phe Ser Pro Asp Asp 

425 430 435 

Lys Leu He Val Thr Gly Thr Ser He Gin Arg Gly Cys Gly Ser 

440 445 450 

Gly Lys Leu Val Phe Phe Glu Arg Arg Thr Phe Gin Arg Val Tyr 

455 460 465 

Glu He Asp He Thr Asp Ala Ser Val Val Arg Cys Leu Trp His 

470 475 480 

Pro Lys Leu Asn Gin He Met Val Gly Thr Gly Asn Gly Leu Ala 

485 490 495 

Lys Val Tyr Tyr Asp Pro Asn Lys Ser Gin Arg Gly Ala Lys Leu 

500 505 510 

Cys Val Val Lys Thr Gin Arg Lys Ala Lys Gin Ala Glu Thr Leu 

515 520 525 

Thr Gin Asp Tyr He He Thr Pro His Ala Leu Pro Met Phe Arg 

530 535 540 

Glu Pro Arg Gin Arg Ser Thr Arg Lys Gin Leu Glu Lys Asp Arg 

545 550 555 

Leu Asp Pro Leu Lys Ser His Lys Pro Glu Pro Pro Val Ala Gly 

560 565 570 

Pro Gly Arg Gly Gly Arg Val Gly Thr His Gly Gly Thr Leu Ser 

575 580 585 

Ser Tyr lie Val Lys Asn He Ala Leu Asp Lys Thr Asp Asp Ser 

590 595 600 

Asn Pro Arg Glu Ala He Leu Arg His Ala Lys Ala Ala Glu Asp 

605 610 615 

Ser Pro Tyr Trp Val Ser Pro Ala Tyr Ser Lys Thr Gin Pro Lys 

620 625 630 
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Thr Met Phe Ala Gin Val Glu Ser Asp Asp Glu Glu Ala Lys Asn 
635 640 S45 

Glu Pro Glu Trp Lys Lys Arg Lys lie 
650 



<210> 14 
<211> 180 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID NO: 2012970CD1 

<400> 14 

Met Glu Ala Asn Met Pro Lys Arg Lys Glu Pro Gly Arg Ser Leu 

15 10 15 

Arg He Lys Val He Ser Met Gly Asn Ala Glu Val Gly Lys Ser 

20 25 30 

Cys He He Lys Arg Tyr Cys Glu Lys Arg Phe Val Ser Lys Tyr 

35 40 45 

Leu Ala Thr He Gly He Asp Tyr Gly Val Thr Lys Val His Val 

50 55 60 

Arg Asp Arg Glu He Lys Val Asn He Phe Asp Met Ala Gly His 

65 70 75 

Pro Phe Phe Tyr Glu Val Arg Asn Glu Phe Tyr Lys Asp Thr Gin 

80 85 90 

Gly Val He Leu Val Tyr Asp Val Gly Gin Lys Asp Ser Phe Asp 

95 100 105 

Ala Leu Asp Ala Trp Leu Ala Glu Met Lys Gin Glu Leu Gly Pro 

110 115 120 

His Gly Asn Met Glu Asn He He Phe Val Val Cys Ala Asn Lys 

125 130 135 

He Asp Cys Thr Lys His Arg Cys Val Asp Glu Ser Glu Gly Arg 

140 145 150 

Leu Trp Ala Glu Ser Lys Gly Phe Leu Tyr Phe Glu Thr Ser Ala 

155 ISO 165 

Gin Thr Gly Glu Gly He Asn Glu Met Phe Gin He His Leu Gly 

170 175 180 



<210> 15 
<211> 374 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> misc_feature 

<223> Incyte ID No: 2254315CD1 

<400> 15 

Met Ala Ala Ser Ala Ala Ala Ala Glu Leu Gin Ala Ser Gly Gly 
15 10 15 

Pro Arg His Pro Val Cys Leu Leu Val Leu Gly Met Ala Gly Ser 
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Gly Lys Thr Thr Phe Val Gin Arg Leu Thr Gly His Leu His Ala 

35 40 45 

Gin Gly Thr Pro Pro Tyr Val lie Asn Leu Asp Pro Ala Val His 

50 55 60 

Glu Val Pro Phe Pro Ala Asn lie Asp lie Arg Asp Thr Val Lys 

65 70 75 

Tyr Lys Glu Val Met Lys Gin Tyr Gly Leu Gly Pro Asn Gly Gly 

80 85 90 

lie Val Thr Ser Leu Asn Leu Phe Ala Thr Arg Phe Asp Gin Val 

95 100 105 

Met Lys Phe He Glu Lys Ala Gin Asn Met Ser Lys Tyr Val Leu 

110 115 120 

He Asp Thr Pro Gly Gin He Glu Val Phe Thr Trp Ser Ala Ser 

125 130 135 

Gly Thr He lie Thr Glu Ala Leu Ala Ser Ser Phe Pro Thr Val 

140 145 150 

Val He Tyr Val Met Asp Thr Ser Arg Ser Thr Asn Pro Val Thr 

155 160 165 

Phe Met Ser Asn Met Leu Tyr Ala Cys Ser lie Leu Tyr Lys Thr 

170 175 180 

Lys Leu Pro Phe He Val Val Met Asn Lys Thr Asp He He Asp 

185 190 195 

His Ser Phe Ala Val Glu Trp Met Gin Asp Phe Glu Ala Phe Gin 

200 205 210 

Asp Ala Leu Asn Gin Glu Thr Thr Tyr Val Ser Asn Leu Thr Arg 

215 220 225 

Ser Met Ser Leu Val Leu Asp Glu Phe Tyr Ser Ser Leu Arg Val 

230 235 240 

Val Gly Val Ser Ala Val Leu Gly Thr Gly Leu Asp Glu Leu Phe 

245 250 255 

Val Gin Val Thr Ser Ala Ala Glu Glu Tyr Glu Arg Glu Tyr Arg 

260 265 270 

Pro Glu Tyr Glu Arg Leu Lys Lys Ser Leu Ala Asn Ala Glu Ser 

275 280 285 

Gin Gin Gin Arg Glu Gin Leu Glu Arg Leu Arg Lys Asp Met Gly 

290 295 300 

Ser Val Ala Leu Asp Ala Gly Thr Ala Lys Asp Ser Leu Ser Pro 

305 310 315 

Val Leu His Pro Ser Asp Leu He Leu Thr Arg Gly Thr Leu Asp 

320 325 330 

Glu Glu Asp Glu Glu Ala Asp Ser Asp Thr Asp Asp He Asp His 

335 340 345 

Arg Val Thr Glu Glu Ser His Glu Glu Pro Ala Phe Gin Asn Phe 

350 355 360 

Met Gin Glu Ser Met Ala Gin Tyr Trp Lys Arg Asn Asn Lys 

365 370 



<210> 16 

<211> 649 

<212> PRT 

<213> Homo sapiens 
<220> 



20/65 



WO 00/31263 



PCTAJS99/28013 



<221> misc_f eaturt. 

<223> Incyte ID No. 2415545CD1 

<400> 16 

Met Glu Gly Ala Gly Ta r Arg Val Val Phe Glu Lys Gly Gly Val 

15 10 15 

Tyr Leu His Thr Ser Ala Lys Lys Tyr Gin Asp Arg Asp Ser Leu 

20 25 30 

lie Ala Gly Val He Arg Val Val Glu Lys Asp Asn Asp Val Leu 

35 40 45 

Leu His Trp Ala Pro Val Glu Glu Ala Gly Asp Ser Thr Gin He 

50 55 60 

Leu Phe Ser Lys Lys Asp Ser Ser Gly Gly Asp Ser Cys Ala Ser 

65 70 75 

Glu Glu Glu Pro Thr Phe Asp Pro Gly .Tyr Glu Pro Asp Trp Ala 

80 85 90 

Val He Ser Thr Val Arg Pro Gin Pro Cys His Ser Glu Pro Thr 

95 100 105 

Arg Gly Ala Glu Pro Ser Cys Pro Gin Gly Ser Trp Ala Phe Ser 

110 115 120 

Val Ser Leu Gly Glu Leu Lys Ser lie Arg Arg Ser Lys Pro Gly 

125 130 135 

Leu Ser Trp Ala Tyr Leu Val Leu Val Thr Gin Ala Gly Gly Ser 

140 145 150 

Leu Pro Ala Leu His Phe His Arg Gly Gly Thr Arg Ala Leu Leu 

155 160 165 

Arg Val Leu Ser Arg Tyr Leu Leu Leu Ala Ser Ser Pro Gin Asp 

170 175 180 

Ser Arg Leu Tyr Leu Val Phe Pro His Asp Ser Ser Ala Leu Ser 

185 190 195 

Asn Ser Phe His His Leu Gin Leu Phe Asp Gin Asp Ser Ser Asn 

200 205 210 

Val Val Ser Arg Phe Leu Gin Asp Pro Tyr Ser Thr Thr Phe Ser 

215 220 225 

Ser Phe Ser Arg Val Thr Asn Phe Phe Arg Gly Ala Leu Gin Pro 

230 235 240 

Gin Pro Glu Gly Ala Ala Ser Asp Leu Pro Pro Pro Pro Asp Asp 

245 250 255 

Glu Pro Glu Pro Gly Phe Glu Val He Ser Cys Val Glu Leu Gly 

260 265 270 

Pro Arg Pro Thr Val Glu Arg Gly Pro Pro Val Thr Glu Glu Glu 

275 280 285 

Trp Ala Arg His Val Gly Pro Glu Gly Arg Leu Gin Gin Val Pro 

290 295 300 

Glu Leu Lys Asn Arg He Phe Ser Gly Gly Leu Ser Pro Ser Leu 

305 310 315 

Arg Arg Glu Ala Trp Lys Phe Leu Leu Gly Tyr Leu Ser Trp Glu 

320 325 330 

Gly Thr Ala Glu Glu His Lys Ala His He Arg Lys Lys Thr Asp 

335 340 345 

Glu Tyr Phe Arg Met Lys Leu Gin Trp Lys Ser Val Ser Pro Glu 

350 355 360 

Gin Glu Arg Arg Asn Ser Leu Leu His Gly Tyr Arg Ser Leu He 

365 370 375 

Glu Arg Asp Val Ser Arg Thr Asp Arg Thr Asn Lys Phe Tyr Glu 
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380 385 390 

Gly Pro Glu Asn Pro Gly Leu Gly Leu Leu Asn Asp lie Leu Leu 

395 400 405 

Thr Tyr Cys Met Tyr His Phe Asp Leu Gly Tyr Val Gin Gly Met 

410 415 420 

Ser Asp Leu Leu Ser Pro lie Leu Tyr Val lie Gin Asn Glu Val 

425 430 435 

Asp Ala Phe Trp Cys Phe Cys Gly Phe Met Glu Leu Val Gin Gly 

440 445 450 

Asn Phe Glu Glu Ser Gin Glu Thr Met Lys Arg Gin Leu Gly Arg 

455 460 465 

Leu Leu Leu Leu Leu Arg Val Leu Asp Pro Leu Leu Cys Asp Phe 

470 475 480 

Leu Asp Ser Gin Asp Ser Gly Ser Leu Cys Phe Cys Phe Arg Trp 

485 490 495 

Leu Leu lie Trp Phe Lys Arg Glu Phe Pro Phe Pro Asp Val Leu 

500 505 510 

Arg Leu Trp Glu Val Leu Trp Thr Gly Leu Pro Gly Pro Asn Leu 

515 520 525 

His Leu Leu Val Ala Cys Ala lie Leu Asp Met Glu Arg Asp Thr 

530 535 540 

Leu Met Leu Ser Gly Phe Gly Ser Asn Glu lie Leu Lys His lie 

545 550 555 

Asn Glu Leu Thr Met Lys Leu Ser Val Glu Asp Val Leu Thr Arg 

560 565 570 

Ala Glu Ala Leu His Arg Gin Leu Thr Ala Cys Thr Arg Ala Ala 

575 580 585. 
Pro Gin Arg Ala Gly Asp Pro Gly Ala Gly Pro Ala Thr Gin Ser 

590 595 600 

Pro Thr Ala Pro Arg Pro Pro Pro Pro Arg Cys Leu Cys Thr Pro 

605 610 615 

Thr Arg Ala Pro Pro Thr Pro Pro Pro Ser Thr Asp Thr Ala Pro 

620 625 630 

Gin Pro Asp Ser Ser Leu Glu He Leu Pro Glu Glu Glu Asp Glu 

635 640 645 

Gly Ala Asp Ser 



<210> 17 
<211> 698 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2707969CD1 

<400> 17 

Met Cys His Asp Asp Asp Asp Lys 

1 5 
Asp Ser Arg Val Asp Lys He Arg 
20 

Thr Leu Arg Thr Ser Met Tyr Gin 
35 

Ala Glu Lys Ala He Glu Leu Arg 



Asp Pro Val Leu Arg Val Phe 

10 15 

Leu Leu Asn Val Arg Thr Pro 

25 30 

Lys Cys Thr Thr Val Asp Glu 

40 45 

Leu Ala Lys He Asp His Thr 
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50 55 60 

Ala lie His Pro His Leu Leu Asp Met Lys He Gly Gin Gly Lys 

65 70 75 

Tyr Glu Pro Gly Phe Phe Pro Lys Leu Gin Ser Asp Val Leu Ser 

80 85 90 

Thr Gly Pro Ala Ser Asn Lys Trp Thr Lys Arg Asn Ala Pro Ala 

95 100 105 

Gin Trp Arg Arg Lys Asp Arg Gin Lys Gin His Thr Glu His Leu 

110 115 120 

Arg Leu Asp Asn Asp Gin Arg Glu Lys Tyr lie Gin Glu Ala Arg 

125 130 135 

Thr Met Gly Ser Thr He Arg Gin Pro Lys Leu Ser Asn Leu Ser 

140 145 150 

Pro Ser Val He Ala Gin Thr Asn Trp Lys Phe Val Glu Gly Leu 

155 160 165 

Leu Lys Glu Cys Arg Asn Lys Thr Lys Arg Met Leu Val Glu Lys 

170 175 180 

Met Gly Arg Glu Ala Val Glu Leu Gly His Gly Glu Val Asn He 

185 190 195 

Thr Gly Val Glu Glu Asn Thr Leu He Ala Ser Leu Cys Asp Leu 

200 205 210 

Leu Glu Arg He Trp Ser His Gly Leu Gin Val Lys Gin Gly Lys 

215 220 225 

Ser Ala Leu Trp Ser His Leu Leu His Tyr Gin Asp Asn Arg Gin 

230 235 240 
Arg Lys Leu Thr Ser Gly Ser Leu Ser Thr Ser Gly He Leu Leu 

245 250 255 

Asp Ser Glu Arg Arg Lys Ser Asp Ala Ser Ser Leu Met Pro Pro 

260 265 270 

Leu Arg He Ser Leu He Gin Asp Met Arg His He Gin Asn lie 

275 280 285 

Gly Glu He Lys Thr Asp Val Gly Lys Ala Arg Ala Trp Val Arg 

290 295 300 

Leu Ser Met Glu Lys Lys Leu Leu Ser Arg His Leu Lys Gin Leu 

305 310 315 

Leu Ser Asp His Glu Leu Thr Lys Lys Leu Tyr Lys Arg Tyr Ala 

320 325 330 

Phe Leu Arg Cys Asp Asp Glu Lys Glu Gin Phe Leu Tyr His Leu 

335 340 345 

Leu Ser Phe Asn Ala Val Asp Tyr Phe Cys Phe Thr Asn Val Phe 

350 355 360 

Thr Thr He Leu He Pro Tyr His He Leu He Val Pro Ser Lys 

365 370 375 

Lys Leu Gly Gly Ser Met Phe Thr Ala Asn Pro Trp He Cys lie 

380 385 390 

Ser Gly Glu Leu Gly Glu Thr Gin He Met Gin He Pro Arg Asn 

395 400 405 
Val Leu Glu Met Thr Phe Glu Cys Gin Asn Leu Gly Lys Leu Thr 

410 415 420 

Thr Val Gin He Gly His Asp Asn Ser Gly Leu Tyr Ala Lys Trp 

425 430 435 

Leu Val Glu Tyr Val Met Val Arg Asn Glu He Thr Gly His Thr 

440 445 450 

Tyr Lys Phe Pro Cys Gly Arg Trp Leu Gly Lys Gly Met Asp Asp 
455 460 465 
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Gly Ser Leu Glu Arg He Leu Val Gly Glu Leu Leu Thr Ser Gin 

470 475 480 

Pro Glu Val Asp Glu Arg Pro Cys Arg Thr Pro Pro Leu Gin Gin 

485 490 495 

Ser Pro Ser Val lie Arg Arg Leu Val Thr He Ser Pro Asn Asn 

500 505 510 

Lys Pro Lys Leu Asn Thr Gly Gin He Gin Glu Ser He Gly Glu 

515 520 525 

Ala Val Asn Gly He Val Lys His Phe His Lys Pro Glu Lys Glu 

530 535 540 

Arg Gly Ser Leu Thr Leu Leu Leu Cys Gly Glu Cys Gly Leu Val 

545 550 555 

Ser Ala Leu Glu Gin Ala Phe Gin His Gly Phe Lys Ser Pro Arg 

560 565 570 

Leu Phe Lys Asn Val Phe He Trp Asp Phe Leu Glu Lys Ala Gin 

575 580 585 

Thr Tyr Tyr Glu Thr Leu Glu Lys Asn Glu Val Val Pro Glu Glu 

590 595 600 

Asn Trp His Thr Arg Ala Arg Asn Phe Cys Arg Phe Val Thr Ala 

605 610 615 

He Asn Asn Thr Pro Arg Asn He Gly Lys Asp Gly Lys Phe Gin 

620 625 630 

Met Leu Val Cys Leu Gly Ala Arg Asp His Leu Leu His His Trp 

635 640 645 

He Ala Leu Leu Ala Asp Cys Pro He Thr Ala His Met Tyr Glu 

650 655 660 

Asp Val Ala Leu He Lys Asp His Thr Leu Val Asn Ser Leu He 

665 670 675 

Arg Val Leu Gin Thr Leu Gin Glu Phe Asn He Thr Leu Glu Thr 

680 685 690 
Ser Leu Val Lys Gly He Asp He 

695 

<210> 18 
<211> 396 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2817769CD1 

<400> 18 

Met Pro Pro Lys Lys Gly Gly Asp Gly He Lys Pro Pro Pro He 

15 10 15 

He Gly Arg Phe Gly Thr Ser Leu Lys He Gly He Val Gly Leu 

20 25 30 

Pro Asn Val Gly Lys Ser Thr Phe Phe Asn Val Leu Thr Asn Ser 

35 40 45 

Gin Ala Ser Ala Glu Asn Phe Pro Phe Cys Thr He Asp Pro Asn 

50 55 60 

Glu Ser Arg Val Pro Val Pro Asp Glu Arg Phe Asp Phe Leu Cys 

65 70 75 

Gin Tyr His Lys Pro Ala Ser Lys He Pro Ala Phe Leu Asn Val 

80 85 90 
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Val Asp He Ala Gly Leu Val Lys Gly Ala His Asn Gly Gin Gly 
95 100 105 

Leu Gly Asn Ala Phe Leu Ser His lie Ser Ala Cys Asp Gly lie 

110 115 120 

Phe His Leu Thr Arg Ala Phe Glu Asp Asp Asp He Thr His Val 

125 130 135 

Glu Gly Ser Val Asp Pro He Arg Asp He Glu lie He His Glu 

140 145 150 

Glu Leu Gin Leu Lys Asp Glu Glu Met He Gly Pro He He Asp 

155 160 165 

Lys Leu Glu Lys Val Ala Val Arg Gly Gly Asp Lys Lys Leu Lys 

170 175 180 

Pro Glu Tyr Asp He Met Cys Lys Val Lys Ser Trp Val He Asp 

185 190 195 

Gin Lys Lys Pro Val Arg Phe Tyr His Asp Trp Asn Asp Lys Glu 

200 205 210 

He Glu Val Leu Asn Lys His Leu Phe Leu Thr Ser Lys Pro Met 

215 220 225 

Val Tyr Leu Val Asn Leu Ser Glu Lys Asp Tyr He Arg Lys Lys 

230 235 240 

Asn Lys Trp Leu He Lys He Lys Glu Trp Val Asp Lys Tyr Asp 

245 250 255 

Pro Gly Ala Leu Val He Pro Phe Ser Gly Ala Leu Glu Leu Lys 

260 265 270 

Leu Gin Glu Leu Ser Ala Glu Glu Arg Gin Lys Tyr Leu Glu Ala 

275 280 285 

Asn Met Thr Gin Ser Ala Leu Pro Lys lie He Lys Ala Gly Phe 

290 295 300 

Ala Ala Leu Gin Leu Glu Tyr Phe Phe Thr Ala Gly Pro Asp Glu 

305 310 315 

Val Arg Ala Trp Thr He Arg Lys Gly Thr Lys Ala Pro Gin Ala 

320 325 330 

Ala Gly Lys He His Thr Asp Phe Glu Lys Gly Phe He Met Ala 

335 340 345 

Glu Val Met Lys Tyr Glu Asp Phe Lys Glu Glu Gly Ser Glu Asn 

350 355 360 

Ala Val Lys Ala Ala Gly Lys Tyr Arg Gin Gin Gly Arg Asn Tyr 

365 370 375 

He Val Glu Asp Gly Asp He He Phe Phe Lys Phe Asn Thr Pro 

380 385 390 

Gin Gin Pro Lys Lys Lys 

395 



<210> 19 
<211> 634 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2917557CD1 

<400> 19 

Met Ser Ser Asp Ser Glu Tyr Asp Ser Asp Asp Asp Arg Thr Lys 
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Glu Glu Arg Ala Tyr Asp Lys Ala Lys Arg Arg lie Glu Lys Arg 
20 25 30 

Arg Leu Glu His Ser Lys Asn Val Asn Thr Glu Lys Leu Arg Ala 
35 40 45 

Pro He He Cys Val Leu Gly His Val Asp Thr Gly Lys Thr Lys 
50 55 60 

He Leu Asp Lys Leu Arg His Thr His Val Gin Asp Gly Glu Ala 
65 70 75 

Gly Gly He Thr Gin Gin He Gly Ala Thr Asn Val Pro Leu Glu 
80 85 90 

Ala lie Asn Glu Gin Thr Lys Met lie Lys Asn Phe Asp Arg Glu 
95 100 105 

Asn Val Arg lie Pro Gly Met Leu He He Asp Thr Pro Gly His 

110 115 120 

Glu Ser Phe Ser Asn Leu Arg Asn Arg Gly Ser Ser Leu Cys Asp 

125 130 135 

He Ala He Leu Val Val Asp He Met His Gly Leu Glu Pro Gin 

140 145 150 

Thr He Glu Ser He Asn Leu Leu Lys Ser Lys Lys Cys Pro Phe 

155 160 165 

He Val Ala Leu Asn Lys He Asp Arg Leu Tyr Asp Trp Lys Lys 

170 175 180 

Ser Pro Asp Ser Asp Val Ala Ala Thr Leu Lys Lys Gin Lys Lys 

185 190 195 

Asn Thr Lys Asp Glu Phe Glu Glu Arg Ala Lys Ala He He Val 

200 205 210 

Glu Phe Ala Gin Gin Gly Leu Asn Ala Ala Leu Phe Tyr Glu Asn 

215 220 225 

Lys Asp Pro Arg Thr Phe Val Ser Leu Val Pro Thr Ser Ala His 

230 235 240 

Thr Gly Asp Gly Met Gly Ser Leu He Tyr Leu Leu Val Glu Leu 

245 250 255 

Thr Gin Thr Met Leu Ser Lys Arg Leu Ala His Cys Glu Glu Leu 

260 265 270 

Arg Ala Gin Val Met Glu Val Lys Ala Leu Pro Gly Met Gly Thr 

275 280 285 

Thr He Asp Val He Leu He Asn Gly Arg Leu Lys Glu Gly Asp 

290 295 300 

Thr He He Val Pro Gly Val Glu Gly Pro He Val Thr Gin He 

305 310 315 

Arg Gly Leu Leu Leu Pro Pro Pro Met Lys Glu Leu Arg Val Lys 

320 325 330 

Asn Gin Tyr Glu Lys His Lys Glu Val Glu Ala Ala Gin Gly Val 

335 340 345 

Lys He Leu Gly Lys Asp Leu Glu Lys Thr Leu Ala Gly Leu Pro 

350 355 360 

Leu Leu Val Ala Tyr Lys Glu Asp Glu He Pro Val Leu Lys Asp 

365 370 375 

Glu Leu He His Glu Leu Lys Gin Thr Leu Asn Ala He Lys Leu 

380 385 390 

Glu Glu Lys Gly Val Tyr Val Gin Ala Ser Thr Leu Gly Ser Leu 

395 400 405 

Glu Ala Leu Leu Glu Phe Leu Lys Thr Ser Glu Val Pro Tyr Ala 

410 415 420 
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Gly 


lie Asn 


lie 


Gly 
425 


Pro 


Val 


His 


Lys 


Lys 
430 


Asp 


Val 


Met 


Lys 


Ala 
435 


Ser 


Val Met 


Leu 


Glu 
440 


His 


Asp 


Pro 


Gin 


Tyr 
445 


Ala 


Val 


He 


Leu 


Ala 
450 


Phe 


Asp Val 


Arg 


lie 
455 


Glu 


Arg 


Asp 


Ala 


Gin 
460 


Glu 


Met 


Ala 


Asp 


Ser 
465 


Leu 


Gly Val 


Arg 


lie 
470 


Phe 


Ser 


Ala 


Glu 


He 
475 


He 


Tyr 


His 


Leu 


Phe 
480 


Asp Ala Phe 


Thr 


Lys 


Tyr Arg 


Gin Asp 


Tyr 


Lys 


Lys 


Gin 


Lys 


Gin 








485 










490 










495 


Glu 


Glu Phe 


Lys 


His 
500 


He 


Ala 


Val 


Phe 


Pro 
505 


Cys 


Lys 


He 


Lys 


He 
510 


Leu 


Pro Gin 


Tyr 


He 
515 


Phe 


Asn 


Ser 


Arg 


Asp 
520 


Pro 


He 


Val 


Met 


Gly 
525 


Val 


Thr Val 


Glu 


Ala 
530 


Gly 


Gin 


Val 


Lys 


Gin 
535 


Gly 


Thr 


Pro 


Met 


Cys 
540 


Val 


Pro Ser 


Lys 


Asn 
545 


Phe 


Val 


Asp 


He 


Gly 
550 


He 


Val 


Thr 


Ser 


He 
555 


Glu 


lie Asn 


His 


Lys 
560 


Gin 


Val 


Asp 


Val 


Ala 
565 


Lys 


Lys 


Gly 


Gin 


Glu 
570 


Val 


Cys Val 


Lys 


He 
575 


Glu 


Pro 


He 


Pro 


Gly 
580 


Glu 


Ser 


Pro 


Lys 


Met 
585 


Phe 


Gly Arg 


His 


Phe 
590 


Glu 


Ala 


Thr 


Asp 


He 
595 


Leu 


Val 


Ser 


Lys 


He 
600 


Ser 


Arg Gin 


Ser 


He 
605 


Asp 


Ala 


Leu 


Lys 


Asp 
610 


Trp 


Phe 


Arg 


Asp 


Glu 
615 


Met 


Gin Lys 


Ser 


Asp 
620 


Trp 


Gin 


Leu 


lie 


Val 
625 


Glu 


Leu 


Lys 


Lys 


Val 
630 



<210> 20 
<211> 196 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3421335CD1 

<400> 20 

Met Gly Ser Val Asn Ser Arg Gly His Lys Ala Glu Ala Gin Val 

15 10 15 
Val Met Met Gly Leu Asp Ser Ala Gly Lys Thr Thr Leu Leu Tyr 

20 25 30 

Lys Leu Lys Gly His Gin Leu Val Glu Thr Leu Pro Thr Val Gly 

35 40 45 

Phe Asn Val Glu Pro Leu Lys Ala Pro Gly His Val Ser Leu Thr 

50 55 60 

Leu Trp Asp Val Gly Gly Gin Ala Pro Leu Arg Ala Ser Trp Lys 

65 70 75 

Asp Tyr Leu Glu Gly Thr Asp He Leu Val Tyr Val Leu Asp Ser 

80 85 90 

Thr Asp Glu Ala Arg Leu Pro Glu Ser Ala Ala Glu Leu Thr Glu 

95 100 105 
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Val Leu Asn Asp Pro Asn Met Ala Gly Val Pro Phe Leu Val Leu 

110 115 120 

Ala Asn Lys Gin Glu Ala Pro Asp Ala Leu Pro Leu Leu Lys lie 

125 130 135 

Arg Asn Arg Leu Ser Leu Glu Arg Phe Gin Asp His Cys Trp Glu 

140 145 150 

Leu Arg Gly Cys Ser Ala Leu Thr Gly Glu Gly Leu Pro Glu Ala 

155 160 165 

Leu Gin Ser Leu Trp Ser Leu Leu Lys Ser Arg Ser Cys Met Cys 

170 175 180 

Leu Gin Ala Arg Ala His Gly Ala Glu Arg Gly Asp Ser Lys Arg 

185 190 195 

Ser 



<210> 21 
<211> 446 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 605761CD1 

<400> 21 

Met Ala Ala Arg Lys Gly Arg Arg Arg Thr Cys Glu Thr Gly Glu 

15 10 15 

Pro Met Glu Ala Glu Ser Gly Asp Thr Ser Ser Glu Gly Pro Ala 

20 25 30 

Gin Val Tyr Leu Pro Gly Arg Gly Pro Pro Leu Arg Glu Gly Glu 

35 40 45 

Glu Leu Val Met Asp Glu Glu Ala Tyr Val Leu Tyr His Arg Ala 

50 55 60 

Gin Thr Gly Ala Pro Cys Leu Ser Phe Asp lie Val Arg Asp His 

65 70 75 

Leu Gly Asp Asn Arg Thr Glu Leu Pro Leu Thr Leu Tyr Leu Cys 

80 85 90 

Ala Gly Thr Gin Ala Glu Ser Ala Gin Ser Asn Arg Leu Met Met 

95 100 105 

Leu Arg Met His Asn Leu His Gly Thr Lys Pro Pro Pro Ser Glu 

110 115 120 

Gly Ser Asp Glu Glu Glu Glu Glu Glu Asp Glu Glu Asp Glu Glu 

125 130 135 

Glu Arg Lys Pro Gin Leu Glu Leu Ala Met Val Pro His Tyr Gly 

140 145 150 

Gly lie Asn Arg Val Arg Val Ser Trp Leu Gly Glu Glu Pro Val 

155 160 165 

Ala Gly Val Trp Ser Glu Lys Gly Gin Val Glu Val Phe Ala Leu 

170 175 180 

Arg Arg Leu Leu Gin Val Val Glu Glu Pro Gin Ala Leu Ala Ala 

185 190 195 

Phe Leu Arg Asp Glu Gin Ala Gin Met Lys Pro He Phe Ser Phe 

200 205 210 

Ala Gly His Met Gly Glu Gly Phe Ala Leu Asp Trp Ser Pro Arg 

215 220 225 
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Val Thr Gly Arg Leu Leu Thr Gly Asp Cys Gin Lys Asn lie His 

230 235 240 

Leu Trp Thr Pro Thr Asp Gly Gly Ser Trp His Val Asp Gin Arg 

245 250 255 

Pro Phe Val Gly His Thr Arg Ser Val Glu Asp Leu Gin Trp Ser 

260 2S5 270 

Pro Thr Glu Asn Thr Val Phe Ala Ser Cys Ser Ala Asp Ala Ser 

275 280 285 

lie Arg lie Trp Asp lie Arg Ala Ala Pro Ser Lys Ala Cys Met 

290 295 300 

Leu Thr Thr Ala Thr Ala His Asp Gly Asp Val Asn Val lie Ser 

305 310 315 

Trp Ser Arg Arg Glu Pro Phe Leu Leu Ser Gly Gly Asp Asp Gly 

320 325 330 

Ala Leu Lys lie Trp Asp Leu Arg Gin Phe Lys Ser Gly Ser Pro 

335 340 345 

Val Ala Thr Phe Lys Gin His Val Ala Pro Val Thr Ser Val Glu 

350 355 360 

Trp His Pro Gin Asp Ser Gly Val Phe Ala Ala Ser Gly Ala Asp 

365 370 375 

His Gin lie Thr Gin Trp Asp Leu Ala Val Glu Arg Asp Pro Glu 

380 385 390 

Ala Gly Asp Val Glu Ala Asp Pro Gly Leu Ala Asp Leu Pro Gin 

395 400 405 

Gin Leu Leu Phe Val His Gin Gly Glu Thr Glu Leu Lys Glu Leu 

410 415 420 

His Trp His Pro Gin Cys Pro Gly Leu Leu Val Ser Thr Ala Leu 

425 430 435 

Ser Gly Phe Thr He Phe Arg Thr He Ser Val 

440 445 



<210> 22 

<211> 265 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 483862CD1 

<400> 22 

Met Ser Ser Gly Leu Arg Ala Ala Asp Phe Pro Arg Trp Lys Arg 

15 10 15 

His He Ser Glu Gin Leu Arg Arg Arg Asp Arg Leu Gin Arg Gin 

20 25 30 

Ala Phe Glu Glu He He Leu Gin Tyr Asn Lys Leu Leu Glu Lys 

35 40 45 

Ser Asp Leu His Ser Val Leu Ala Gin Lys Leu Gin Ala Glu Lys 

50 55 60 

His Asp Val Pro Asn Arg His Glu He Ser Pro Gly His Asp Gly 

65 70 75 

Thr Trp Asn Asp Asn Gin Leu Gin Glu Met Ala Gin Leu Arg He 

80 85 90 

Lys His Gin Glu Glu Leu Thr Glu Leu His Lys Lys Arg Gly Glu 
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95 100 105 

Leu Ala Gin Leu Val lie Asp Leu Asn Asn Gin Met Gin Arg Lys 

110 115 120 

Asp Arg Glu Met Gin Met Asn Glu Ala Lys He Ala Glu Cys Leu 

125 130 135 

Gin Thr He Ser Asp Leu Glu Thr Glu Cys Leu Asp Leu Arg Thr 

140 145 150 

Lys Leu Cys Asp Leu Glu Arg Ala Asn Gin Thr Leu Lys Asp Glu 

155 ISO 165 

Tyr Asp Ala Leu Gin He Thr Phe Thr Ala Leu Glu Gly Lys Leu 

170 175 180 

Arg Lys Thr Thr Glu Glu Asn Gin Glu Leu Val Thr Arg Trp Met 

185 190 195 

Ala Glu Lys Ala Gin Glu Ala Asn Arg Leu Asn Ala Glu Asn Glu 

200 205 210 

Lys Asp Ser Arg Arg Arg Gin Ala Arg Leu Gin Lys Glu Leu Ala 

215 220 225 

Glu Ala Ala Lys Glu Pro Leu Pro Val Glu Gin Asp Asp Asp He 

230 235 240 

Glu Val He Val Asp Glu Thr Ser Asp His Thr Glu Glu Thr Ser 

245 250 255 

Pro Val Arg Ala He Ser Arg Ala Ala Thr 

2S0 265 



<210> 23 

<211> 185 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1256777CD1 

<400> 23 

Met Leu Lys Ala Lys He Leu Phe Val Gly Pro Cys Glu Ser Gly 

15 10 15 

Lys Thr Val Leu Ala Asn Phe Leu Thr Glu Ser Ser Asp He Thr 

20 25 30 

Glu Tyr Ser Pro Thr Gin Gly Val Arg He Leu Glu Phe Glu Asn 

35 40 45 

Pro His Val Thr Ser Asn Asn Lys Gly Thr Gly Cys Glu Phe Glu 

50 55 60 

Leu Trp Asp Cys Gly Gly Asp Ala Lys Phe Glu Ser Cys Trp Pro 

65 70 75 

Ala Leu Met Lys Asp Ala His Gly Val Val He Val Phe Asn Ala 

80 85 90 

Asp He Pro Ser His Arg Lys Glu Met Glu Met Trp Tyr Ser Cys 

95 100 105 

Phe Val Gin Gin Pro Ser Leu Gin Asp Thr Gin Cys Met Leu He 

110 H5 120 

Ala His His Lys Pro Gly Ser Gly Asp Asp Lys Gly Ser Leu Ser 

125 130 135 

Leu Ser Pro Pro Leu Asn Lys Leu Lys Leu Val His Ser Asn Leu 

140 145 150 



30/65 



WO 00/31263 



PCT7US99/28013 



Glu Asp Asp Pro Glu Glu He Arg Met Glu Phe He Lys Tyr Leu 

155 160 165 

Lys Ser He He Asn Ser Met Ser Glu Ser Arg Asp Arg Glu Glu 

170 175 180 

Met Ser He Met Thr 
185 

<210> 24 

<211> 554 

<212> PRT 

<213> Homo sapiens 

<220> 

<22l> misc_f eature 

<223> Incyte ID No : 2198779CD1 

<400> 24 

Met Gly Ser Arg Asn Ser Ser Ser Ala Gly Ser Gly Ser Gly Asp 

15 10 15 

Pro Ser Glu Gly Leu Pro Arg Arg Gly Ala Gly Leu Arg Arg Ser 

20 25 30 

Glu Glu Glu Glu Glu Glu Asp Glu Asp Val Asp Leu Ala Gin Val 

35 40 45 

Leu Ala Tyr Leu Leu Arg Arg Gly Gin Val Arg Leu Val Gin Gly 

50 55 60 

Gly Gly Ala Ala Asn Leu Gin Phe He Gin Ala Leu Leu Asp Ser 

65 70 75 

Glu Glu Glu Asn Asp Arg Ala Trp Asp Gly Arg Leu Gly Asp Arg 

80 85 90 

Tyr Asn Pro Pro Val Asp Ala Thr Pro Asp Thr Arg Glu Leu Glu 

95 100 105 

Phe Asn Glu He Lys Thr Gin Val Glu Leu Ala Thr Gly Gin Leu 

110 115 120 

Gly Leu Arg Arg Ala Ala Gin Lys His Ser Phe Pro Arg Met Leu 

125 130 135 

His Gin Arg Glu Arg Gly Leu Cys His Arg Gly Ser Phe Ser Leu 

140 145 150 

Gly Glu Gin Ser Arg Val He Ser His Phe Leu Pro Asn Asp Leu 

155 160 165 

Gly Phe Thr Asp Ser Tyr Ser Gin Lys Ala Phe Cys Gly He Tyr 

170 175 180 

Ser Lys Asp Gly Gin He Phe Met Ser Ala Cys Gin Asp Gin Thr 

185 190 195 

He Arg Leu Tyr Asp Cys Arg Tyr Gly Arg Phe Arg Lys Phe Lys 

200 205 210 

Ser He Lys Ala Arg Asp Val Gly Trp Ser Val Leu Asp Val Ala 

215 220 225 

Phe Thr Pro Asp Gly Asn His Phe Leu Tyr Ser Ser Trp Ser Asp 

230 235 240 

Tyr He His He Cys Asn He Tyr Gly Glu Gly Asp Thr His Thr 

245 250 255 

Ala Leu Asp Leu Arg Pro Asp Glu Arg Arg Phe Ala Val Phe Ser 

260 265 270 
He Ala Val Ser Ser Asp Gly Arg Glu Val Leu Gly Gly Ala Asn 

275 280 285 
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Asp Gly Cys Leu Tyr Val Phe Asp Arg Glu Gin Asn Arg Arg Thr 

290 295 300 

Leu Gin lie Glu Ser His Glu Asp Asp Val Asn Ala Val Ala Phe 

305 310 315 

Ala Asp He Ser Ser Gin He Leu Phe Ser Gly Gly Asp Asp Ala 

320 325 330 

He Cys Lys Val Trp Asp Arg Arg Thr Met Arg Glu Asp Asp Pro 

335 340 345 
Lys Pro Val Gly Ala Leu Ala Gly His Gin Asp Gly He Thr Phe 

350 355 360 

He Asp Ser Lys Gly Asp Ala Arg Tyr Leu lie Ser Asn Ser Lys 

365 370 375 

Asp Gin Thr He Lys Leu Trp Asp He Arg Arg Phe Ser Ser Arg 

380 385 390 

Glu Gly Met Glu Ala Ser Arg Gin Ala Ala Thr Gin Gin Asn Trp 

395 400 405 

Asp Tyr Arg Trp Gin Gin Val Pro Lys Lys Gly Phe Thr Leu His 

410 415 420 

Pro Tyr Pro Ala Trp Arg Lys Leu Lys Leu Pro Gly Asp Ser Ser 

425 430 435 

Leu Met Thr Tyr Arg Gly His Gly Val Leu His Thr Leu He Arg 

440 445 450 

Cys Arg Phe Ser Pro lie His Ser Thr Gly Gin Gin Phe lie Tyr 

455 460 465 

Ser Gly Cys Ser Thr Gly Lys Val Val Val Tyr Asp Leu Leu Ser 

470 475 480 

Gly His He Val Lys Lys Leu Thr Asn His Lys Ala Cys Val Arg 

485 490 495 

Asp Val Ser Trp His Pro Phe Glu Glu Lys He Val Ser Ser Ser 

500 505 510 

Trp Asp Gly Asn Leu Arg Leu Trp Gin Tyr Arg Gin Ala Glu Tyr 

515 520 525 

Phe Gin Asp Asp Met Pro Glu Ser Glu Glu Cys Ala Ser Ala Pro 

530 535 540 

Ala Pro Val Pro Gin Ser Ser Thr Pro Phe Ser Ser Pro Gin 

545 550 



<210> 25 

<211> 434 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2226116CD1 

<400> 25 

Met Arg Pro Ser Ser Ser Val Ser 

1 5 
Gin Val Ser His Phe Ala Asn Leu 
20 

Pro Ala Arg Cys Phe Ser Ala Arg 
35 

Val Gly His Trp Ala Leu Ser Thr 



Val Ser Cys Pro Ala Leu Asn 
10 15 

Thr Ser Val Gly Ala Met Ala 
25 30 

Leu Arg Thr Val Phe Gin Gly 
40 45 

Trp Ala Gly Leu Lys Pro Ser 
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50 










55 










60 


Arg 


Leu 


Leu 


Pro 


Gin 


Arg 


Ala 


Ser 


Pro 


Arg 


Leu 


Leu 


Ser 


Val 


Gly 










65 










70 










75 


Arg 


Ala 


Asp 


Leu 


Ala 
80 


Lys 


His 


Gin 


Glu 


Leu 
85 


Pro 


Gly Lys 


Lys 


Leu 
90 


Leu 


Ser 


Glu 




Lys 






Arg 


Tyr 


Phe 


Val 


sp 


yr 


Arg 


Arg 










95 










100 










105 


Val 


Leu 


Val 


Cys 


Gly 


Gly Asn 


Gly 


Gly 


Ala 


Gly 


Ala 


Ser 


Cys 


Phe 










110 










115 










120 


His 


Ser 


Glu 


Pro 


Arg 


Lys 


Glu 


Phe 


Gly 


Gly 


Pro 


sp 


Gl 


Gly 


Asp 










125 










130 










135 


Gly Gly Asn Gly 


Gly 


His 


Val 


He 


Leu 


Arg 


Val 


sp 


Gl 

n 


Gin 


Val 










140 










145 










150 


Lys 


Ser 


Leu 


Ser 


Ser 


Val 


Leu 


Ser 


Arg 


Tyr 


Gin 


y 


Ph 


Ser 


Gly 










155 










160 










165 


Glu 


Asp 


Gly 


Gly 


Ser 


Lys 


Asn 


Cys 


Phe 


Gly 


Arg 


er 


G y 


Ala 


Val 










170 










175 










180 


Leu 


Tyr 


He 


Arg 


Val 


Pro 


Val 


Gly 


Thr 


Leu 


Val 


Lys 


Glu 


Gly 


Gly 










185 










190 










195 


Arg 


Val 


Val 


Ala 


Asp 


Leu 


Ser 


Cys 


Val 


Gly 


Asp 


Glu 


Tyr 


He 


Ala 










200 










205 










210 


Ala 


Leu 


Gly 


Gly 


Ala 


Gly 


Gly 


Lys 


Gly 


Asn 


Arg 


Phe 


Phe 


Leu 


Ala 










215 










220 










225 


Asn 


Asn 


Asn 


Arg 


Ala 


Pro 


Val 


Thr 


Cys 


Thr 


Pro 


Gly 


Gin 


Pro 


Gly 










230 










235 










240 


Gin 


Gin 


Arg 


Val 


Leu 


His 


Leu 


Glu 


Leu 


Lys 


Thr 


Val 


Ala 


His 


Ala 










245 










250 










255 


Gly 


Met 


Val 


Gly 


Phe 


Pro 


Asn 


Ala 


Gly 


Lys 


Ser 


Ser 


Leu 


Leu 


Arg 










260 










265 










270 


Ala 


He 


Ser 


Asn 


Ala 


Arg 


Pro 


Ala 


Val 


Ala 


Ser 


Tyr 


Pro 


Phe 


Thr 










275 










280 










285 


Thr 


Leu 


Lys 


Pro 


His 


Val 


Gly 


He 


Val 


His 


Tyr 


Glu 


Gly 


His 


Leu 










290 










295 










300 


Gin 


He 


Ala 


Val 


Ala 


Asp 


He 


Pro 


Gly 


He 


He 


Arg 


Gly 


Ala 


His 










305 










310 










315 


Gin 


Asn 


Arg 


Gly 


Leu 


Gly 


Ser 


Ala 


Phe 


Leu 


Arg 


His 


He 


Glu 


Arg 










320 










325 










330 


Cys 


Arg 


Phe 


Leu 


Leu 


Phe 


Val 


Val 


Asp 


Leu 


Ser 


Gin 


Pro 


Glu 


Pro 










335 










340 










345 


Trp 


Thr 


Gin 


Val 


Asp 


Asp 


Leu 


Lys 


Tyr 


Glu 


Leu 


Gl 




Tyr 


Glu 










350 










355 










360 


Lys 


Gly 


Leu 


Ser 


Ala 




Pro 


His 


Ala 


He 


Val 


Ala 


Asn 


Lys 


He 










365 










370 










375 


Asp 


Leu 


Pro 


Glu 


Ala 


Gin 


Ala 


Asn 


Leu 


Ser 


Gin 


Leu 


Arg 


Asp 


His 










380 










385 










390 




Gly 


Gin 


Glu 


Val 


He 


Val 


Leu 


Ser 


Ala 


Leu 


Thr 


Gly 


Glu 


Asn 










395 










400 










405 


Leu 


Glu 


Gin 


Leu 




Leu 


Has 






Val 




Tyr 


Asp 


Ala 


Tyr 










410 










415 










420 


Ala 


Glu 


Ala 


Glu 


Leu 


Gly 


Gin 


Gly Arg 


Gin 


Pro 


Leu 


Arg 


Trp 












425 










430 













<210> 26 
<211> 826 
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<212> PRT 

<213> Homo sapiens 
<220> 

<22l> misc_f eature 

<223> Incyte ID No: 2504472CD1 

<400> 26 

Met Val Ala Pro Val Leu Glu Thr Ser His Val Phe Cys Cys Pro 

15 10 15 

Asn Arg Val Arg Gly Val Leu Asn Trp Ser Ser Gly Pro Arg Gly 

20 25 30 

Leu Leu Ala Phe Gly Thr Ser Cys Ser Val Val Leu Tyr Asp Pro 

35 40 45 

Leu Lys Arg Val Val Val Thr Asn Leu Asn Gly His Thr Ala Arg 

50 55 60 

Val Asn Cys lie Gin Trp lie Cys Lys Gin Asp Gly Ser Pro Ser 

65 70 75 

Thr Glu Leu Val Ser Gly Gly Ser Asp Asn Gin Val lie His Trp 

80 85 90 

Glu lie Glu Asp Asn Gin Leu Leu Lys Ala Val His Leu Gin Gly 

95 100 105 

His Glu Gly Pro Val Tyr Ala Val His Ala Val Tyr Gin Arg Arg 

110 115 120 

Thr Ser Asp Pro Ala Leu Cys Thr Leu lie Val Ser Ala Ala Ala 

125 130 135 

Asp Ser Ala Val Arg Leu Trp Ser Lys Lys Gly Pro Glu Val Met 

140 145 150 

Cys Leu Gin Thr Leu Asn Phe Gly Asn Gly Phe Ala Leu Ala Leu 

155 160 165 

Cys Leu Ser Phe Leu Pro Asn Thr Asp Val Pro lie Leu Ala Cys 

170 175 180 

Gly Asn Asp Asp Cys Arg lie His lie Phe Ala Gin Gin Asn Asp 

185 190 195 

Gin Phe Gin Lys Val Leu Ser Leu Cys Gly His Glu Asp Trp lie 

200 205 210 

Arg Gly Val Glu Trp Ala Ala Phe Gly Arg Asp Leu Phe Leu Ala 

215 220 225 

Ser Cys Ser Gin Asp Cys Leu lie Arg lie Trp Lys Leu Tyr lie 

230 235 240 

Lys Ser Thr Ser Leu Glu Thr Gin Asp Asp Asp Asn lie Arg Leu 

245 250 255 

Lys Glu Asn Thr Phe Thr lie Glu Asn Glu Ser Val Lys He Ala 

260 265 270 

Phe Ala Val Thr Leu Glu Thr Val Leu Ala Gly His Glu Asn Trp 

275 280 285 

Val Asn Ala Val His Trp Gin Pro Val Phe Tyr Lys Asp Gly Val 

290 295 300 

Leu Gin Gin Pro Val Arg Leu Leu Ser Ala Ser Met Asp Lys Thr 

305 310 315 

Met He Leu Trp Ala Pro Asp Glu Glu Ser Gly Val Trp Leu Glu 

320 325 330 

Gin Val Arg Val Gly Glu Val Gly Gly Asn Thr Leu Gly Phe Tyr 

335 340 345 

Asp Cys Gin Phe Asn Glu Asp Gly Ser Met He He Ala His Ala 
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350 355 360 

Phe His Gly Ala Leu His Leu Trp Lys Gin Asn Thr Val Asn Pro 

365 370 375 

Arg Glu Trp Thr Pro Glu lie Val lie Ser Gly His Phe Asp Gly 

380 385 390 
Val Gin Asp Leu Val Trp Asp Pro Glu Gly Glu Phe lie lie Thr 

395 400 405 

Val Gly Thr Asp Gin Thr Thr Arg Leu Phe Ala Pro Trp Lys Arg 

410 415 420 

Lys Asp Gin Ser Gin Val Thr Trp His Glu He Ala Arg Pro Gin 

425 430 435 

He His Gly Tyr Asp Leu Lys Cys Leu Ala Met He Asn Arg Phe 

440 445 450 

Gin Phe Val Ser Gly Ala Asp Glu Lys Val Leu Arg Val Phe Ser 

455 460 465 

Ala Pro Arg Asn Phe Val Glu Asn Phe Cys Ala He Thr Gly Gin 

470 475 480 

Ser Leu Asn His Val Leu Cys Asn Gin Asp Ser Asp Leu Pro Glu 

485 490 495 

Gly Ala Thr Val Pro Ala Leu Gly Leu Ser Asn Lys Ala Val Phe 

500 505 510 

Gin Gly Asp He Ala Ser Gin Pro Ser Asp Glu Glu Glu Leu Leu 

515 520 525 

Thr Ser Thr Gly Phe Glu Tyr Gin Gin Val Ala Phe Gin Pro Ser 

530 535 540 

He Leu Thr Glu Pro Pro Thr Glu Asp His Leu Leu Gin Asn Thr 

545 550 555 

Leu Trp Pro Glu Val Gin Lys Leu Tyr Gly His Gly Tyr Glu He 

560 565 570 

Phe Cys Val Thr Cys Asn Ser Ser Lys Thr Leu Leu Ala Ser Ala 

575 580 585 

Cys Lys Ala Ala Lys Lys Glu His Ala Ala He lie Leu Trp Asn 

590 595 600 

Thr Thr Ser Trp Lys Gin Val Gin Asn Leu Val Phe His Ser Leu 

605 610 615 

Thr Val Thr Gin Met Ala Phe Ser Pro Asn Glu Lys Phe Leu Leu 

620 625 630 

Ala Val Ser Arg Asp Arg Thr Trp Ser Leu Trp Lys Lys Gin Asp 

635 640 645 

Thr He Ser Pro Glu Phe Glu Pro Val Phe Ser Leu Phe Ala Phe 

650 655 660 

Thr Asn Lys He Thr Ser Val His Ser Arg He He Trp Ser Cys 

665 670 675 

Asp Trp Ser Pro Asp Ser Lys Tyr Phe Phe Thr Gly Ser Arg Asp 

680 685 690 

Lys Lys Val Val Val Trp Gly Glu Cys Asp Ser Thr Asp Asp Cys 

695 700 705 

He Glu His Asn He Gly Pro Cys Ser Ser Val Leu Asp Val Gly 

710 715 720 

Gly Ala Val Thr Ala Val Ser Val Cys Pro Val Leu His Pro Ser 

725 730 735 

Gin Arg Tyr Val Val Ala Val Gly Leu Glu Cys Gly Lys He Cys 

740 745 750 

Leu Tyr Thr Trp Lys Lys Thr Asp Gin Val Pro Glu lie Asn Asp 

755 760 765 
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Trp Thr His Cys Val Glu Thr Ser Gin Ser Gin Ser His Thr Leu 

770 775 780 

Ala lie Arg Lys Leu Cys Trp Lys Asn Cys Ser Gly Lys Thr Glu 

785 790 795 

Gin Lys Glu Ala Glu Gly Ala Glu Trp Leu His Phe Ala Ser Cys 

800 805 810 

Gly Glu Asp His Thr Val Lys lie His Arg Val Asn Lys Cys Ala 

815 820 825 



<210> 27 

<211> 618 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3029920CD1 

<400> 27 

Met Lys Lys Asp Val Arg lie Leu Leu Val Gly Glu Pro Arg Val 

15 10 15 

Gly Lys Thr Ser Leu lie Met Ser Leu Val Ser Glu Glu Phe Pro 

20 25 30 

Glu Glu Val Pro Pro Arg Ala Glu Glu lie Thr lie Pro Ala Asp 

35 40 45 

Val Thr Pro Glu Arg Val Pro Thr His He Val Asp Tyr Ser Glu 

50 55 60 

Ala Glu Gin Ser Asp Glu Gin Leu His Gin Glu He Ser Gin Ala 

65 70 75 

Asn Val lie Cys He Val Tyr Ala Val Asn Asn Lys His Ser He 

80 85 90 

Asp Lys Val Thr Ser Arg Trp He Pro Leu lie Asn Glu Arg Thr 

95 100 105 

Asp Lys Asp Ser Arg Leu Pro Leu He Leu Val Gly Asn Lys Ser 

110 115 120 

Asp Leu Val Glu Tyr Ser Ser Met Glu Thr lie Leu Pro He Met 

125 130 135 

Asn Gin Tyr Thr Glu He Glu Thr Cys Val Glu Cys Ser Ala Lys 

140 145 150 

Asn Leu Lys Asn He Ser Glu Leu Phe Tyr Tyr Ala Gin Lys Ala 

155 160 165 

Val Leu His Pro Thr Gly Pro Leu Tyr Cys Pro Glu Glu Lys Glu 

170 175 180 

Met Lys Pro Ala Cys He Lys Ala Leu Thr Arg He Phe Lys He 

185 190 195 

Ser Asp Gin Asp Asn Asp Gly Thr Leu Asn Asp Ala Glu Leu Asn 

200 205 210 

Phe Phe Gin Arg He Cys Phe Asn Thr Pro Leu Ala Pro Gin Ala 

215 220 225 

Leu Glu Asp Val Lys Asn Val Val Arg Lys His He Ser Asp Gly 

230 235 240 

Val Ala Asp Ser Gly Leu Thr Leu Lys Gly Phe Leu Phe Leu His 

245 250 255 
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Thr Leu Phe He Gin Arg Gly Arg His Glu Thr Thr Trp Thr Val 

260 265 270 

Leu Arg Arg Phe Gly Tyr Asp Asp Asp Leu Asp Leu Thr Pro Glu 

275 280 285 

Tyr Leu Phe Pro Leu Leu Lys He Pro Pro Asp Cys Thr Thr Glu 

290 295 300 

Leu Asn His His Ala Tyr Leu Phe Leu Gin Ser Thr Phe Asp Lys 

305 310 315 

His Asp Leu Asp Arg Asp Cys Ala Leu Ser Pro Asp Glu Leu Lys 

320 325 330 

Asp Leu Phe Lys Val Phe Pro Tyr He Pro Trp Gly Pro Asp Val 

335 340 345 

Asn Asn Thr Val Cys Thr Asn Glu Arg Gly Trp lie Thr Tyr Gin 

350 355 360 

Gly Phe Leu Ser Gin Trp Thr Leu Thr Thr Tyr Leu Asp Val Gin 

365 370 375 

Arg Cys Leu Glu Tyr Leu Gly Tyr Leu Gly Tyr Ser He Leu Thr 

380 385 390 

Glu Gin Glu Ser Gin Ala Ser Ala Val Thr Val Thr Arg Asp Lys 

395 400 405 

Lys He Asp Leu Gin Lys Lys Gin Thr Gin Arg Asn Val Phe Arg 

410 415 420 

Cys Asn Val He Gly Val Lys Asn Cys Gly Lys Ser Gly Val Leu 

425 430 435 

Gin Ala Leu Leu Gly Arg Asn Leu Met Arg Gin Lys Lys lie Arg 

440 445 450 

Glu Asp His Lys Ser Tyr Tyr Ala He Asn Thr Val Tyr Val Tyr 

455 460 465 

Gly Gin Glu Lys Tyr Leu Leu Leu His Asp He Ser Glu Ser Glu 

470 475 480 

Phe Leu Thr Glu Ala Glu He He Cys Asp Val Val Cys Leu Val 

485 490 495 

Tyr Asp Val Ser Asn Pro Lys Ser Phe Glu Tyr Cys Ala Arg He 

500 505 510 

Phe Lys Gin His Phe Met Asp Ser Arg He Pro Cys Leu He Val 

515 520 525 

Ala Ala Lys Ser Asp Leu His Glu Val Lys Gin Glu Tyr Ser He 

530 535 540 

Ser Pro Thr Asp Phe Cys Arg Lys His Lys Met Pro Pro Pro Gin 

545 550 555 

Ala Phe Thr Cys Asn Thr Ala Asp Ala Pro Ser Lys Asp He Phe 

560 565 570 

Val Lys Leu Thr Thr Met Ala Met Tyr Pro His Val Thr Gin Ala 

575 580 585 

Asp Leu Lys Ser Ser Thr Phe Trp Leu Arg Ala Ser Phe Gly Ala 

590 595 600 

Thr Val Phe Ala Val Leu Gly Phe Ala Met Tyr Lys Ala Leu Leu 

605 610 615 

Lys Gin Arg 



<21Q> 28 

<211> 596 

<212> PRT 

<213> Homo sapiens 
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<220> 

<22l> misc_feature 

<223> Incyte ID No: 3332415CD1 



<400> 28 

Met Glu Pro Glu Leu Asp Ala Gin Lys Gin Pro Arg Pro Arg Arg 

15 10 15 

Arg Ser Arg Arg Ala Ser Gly Leu Ser Thr Glu Gly Ala Thr Gly 

20 25 30 

Pro Ser Ala Asp Thr Ser Gly Ser Glu Leu Asp Gly Arg Cys Ser 

35 40 45 

Leu Arg Arg Gly Ser Ser Phe Thr Phe Leu Thr Pro Gly Pro Asn 

50 55 60 

Trp Asp Phe Thr Leu Lys Arg Lys Arg Arg Glu Lys Asp Asp Asp 

65 .70 75 

Val Val Ser Leu Ser Ser Leu Asp Leu Lys Glu Pro Ser Asn Lys 

80 85 90 

Arg Val Arg Pro Leu Ala Arg Val Thr Ser Leu Ala Asn Leu lie 

95 100 105 

Ser Pro Val Arg Asn Gly Ala Val Arg Arg Phe Gly Gin Thr He 

110 115 120 

Gin Ser Phe Thr Leu Arg Gly Asp His Arg Ser Pro Ala Ser Ala 

125 130 135 

Gin Lys Phe Ser Ser Arg Ser Thr Val Pro Thr Pro Ala Lys Arg 

140 145 150 

Arg Ser Ser Ala Leu Trp Ser Glu Met Leu Asp lie Thr Met Lys 

155 160 165 

Glu Ser Leu Thr Thr Arg Glu lie Arg Arg Gin Glu Ala lie Tyr 

170 175 180 

Glu Met Ser Arg Gly Glu Gin Asp Leu He Glu Asp Leu Lys Leu 

185 190 195 

Ala Arg Lys Ala Tyr His Asp Pro Met Leu Lys Leu Ser lie Met 

200 205 210 

Ser Glu Glu Glu Leu Thr His He Phe Gly Asp Leu Asp Ser Tyr 

215 220 225 

He Pro Leu His Glu Asp Leu Leu Thr Arg He Gly Glu Ala Thr 

230 235 240 

Lys Pro Asp Gly Thr Val Glu Gin He Gly His He Leu Val Ser 

245 250 255 

Trp Leu Pro Arg Leu Asn Ala Tyr Arg Gly Tyr Cys Ser Asn Gin 

260 265 270 

Leu Ala Ala Lys Ala Leu Leu Asp Gin Lys Lys Gin Asp Pro Arg 

275 280 285 

Val Gin Asp Phe Leu Gin Arg Cys Leu Glu Ser Pro Phe Ser Arg 

290 295 300 

Lys Leu Asp Leu Trp Ser Phe Leu Asp lie Pro Arg Ser Arg Leu 

305 310 315 

Val Lys Tyr Pro Leu Leu Leu Lys Glu He Leu Lys His Thr Pro 

320 325 330 

Lys Glu His Pro Asp Val Gin Leu Leu Glu Asp Ala He Leu He 

335 340 345 

He Gin Gly Val Leu Ser Asp He Asn Leu Lys Lys Gly Glu Ser 

350 355 360 

Glu Cys Gin Tyr Tyr He Asp Lys Leu Glu Tyr Leu Asp Glu Lys 

365 370 375 
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Gin Arg Asp Pro Arg lie Glu Ala Ser Lys Val Leu Leu Cys His 

380 385 390 

Gly Glu Leu Arg Ser Lys Ser Gly His Lys Leu Tyr lie Phe Leu 

395 400 405 

Phe Gin Asp lie Leu Val Leu Thr Arg Pro Val Thr Arg Asn Glu 

410 415 420 

Arg His Ser Tyr Gin Val Tyr Arg Gin Pro lie Pro Val Gin Glu 

425 430 435 

Leu Val Leu Glu Asp Leu Gin Asp Gly Asp Val Arg Met Gly Gly 

440 445 450 

Ser Phe Arg Gly Ala Phe Ser Asn Ser Glu Lys Ala Lys Asn He 

455 460 465 

Phe Arg He Arg Phe His Asp Pro Ser Pro Ala Gin Ser His Thr 

470 475 480 

Leu Gin Ala Asn Asp Val Phe His Lys Gin Gin Trp Phe Asn Cys 

485 490 495 

He Arg Ala Ala He Ala Pro Phe Gin Ser Ala Gly Ser Pro Pro 

500 505 510 

Glu Leu Gin Gly Leu Pro Glu Leu His Glu Glu Cys Glu Gly Asn 

515 520 525 

His Pro Ser Ala Arg Lys Leu Thr Ala Gin Arg Arg Ala Ser Thr 

530 535 540 

Val Ser Ser Val Thr Gin Val Glu Val Asp Glu Asn Ala Tyr Arg 

545 550 555 

Cys Gly Ser Gly Met Gin Met Ala Glu Asp Ser Lys Ser Leu Lys 

560 565 570 

Thr His Gin Thr Gin Pro Gly He Arg Arg Ala Arg Asp Lys Ala 

575 580 585 
Leu Ser Gly Gly Lys Arg Lys Glu Thr Leu Val 

590 595 



<210> 29 
<211> 589 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4031536CD1 

<400> 29 

Met Ser Lys Pro Gly Lys Pro Thr Leu Asn His Gly Leu Val Pro 

15 10 15 

Val Asp Leu Lys Ser Ala Lys Glu Pro Leu Pro His Gin Thr Val 

20 25 30 

Met Arg He Phe Ser He Ser He He Ala Gin Gly Leu Pro Phe 

35 40 45 

Cys Arg Arg Arg Met Lys Arg Lys Leu Asp His Gly Ser Glu Val 

50 55 60 

Arg Ser Phe Ser Leu Gly Lys Lys Pro Cys Lys Val Ser Glu Tyr 

65 70 75 

Thr Ser Thr Thr Gly Leu Val Pro Cys Ser Ala Thr Pro Thr Thr 

80 85 90 

Phe Gly Asp Leu Arg Ala Ala Asn Gly Gin Gly Gin Gin Arg Arg 
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95 








Arg 


He 


Thr 


Ser 


Val 

110 


Gin 


Pro 


Pro 


Lys 


Met 


Phe 


Gin 


Ser 
125 


Trp 


Ser 


Gly 


Asp 


Glu 


Leu 


He 


Asp 
140 


Ser 


Cys 


Glu 


Met 


Gin 


Val 


He 


Glu 
155 


Pro 


Gin 


Phe 


Leu 


Pro 


Lys 


Glu 


Leu 
170 


Ala 


Leu 


Tyr 


Lys 


Asp 


Leu 


Leu 


Gin 
185 


Ala 


Ala 


Gin 


Leu 


Ala 


Glu 


Asp 


Asn 

200 


Leu 


Leu 


Trp 


Gly 


He 


Asp 


Glu 


Pro 
215 


Leu 


His 


He 


Pro 


Gly 


Phe 


He 


His 
230 


Ser 


Pro 


Trp 


His 


Arg 


He 


Asp 


Thr 
245 


Asn 


Trp 


Arg 


Lys 


Val 


Leu 


Lys 


Gly 
260 


His 


Asp 


Asp 


Phe 


Cys 


Gly Asn 




He 


Val 


Ser 










275 








Lys 


Val 


Trp 


Ser 


Ala 
290 


Val 


Thr 


Gly 


Gly 


His 


Thr 


Gly 


Gly 
305 


Val 






lie 


He 


Ser 


Gly 


Ser 
320 


Thr 


Asp 


Arg 


Glu 


Thr 


Gly 


Glu 


Cys 
335 


He 


His 


Thr 


Val 


Arg 


Cys 


Met 


His 
350 




His 


Glu 


Arg 


Asp 


Ala 


Thr 


365 




Val 


Trp 


Leu 


His 


Val 


Leu 


Met 
380 


Gly 


His 


Val 


Tyr 


Asp 


Gly 




395 


Val 


Val 


Ser 


Lys 


Val 


Trp 


Asp 


Pro 
410 


Glu 


Thr 


Glu 


Gly His 


Thr 


Asn 


Arg 


Val 


Tyr 


Ser 










425 








Val 


Val 


Ser 


Gly 


Ser 
440 


Leu 


Asp 


Thr 


Glu 


Thr 


Gly Asn 


Cys 


He 


His 


Thr 










455 








Thr 


Ser 


Gly Met 


Glu 


Leu 


Lys 


Asp 










470 








Ala 


Asp 


Ser 


Thr 


Val 
485 


Lys 


He 


Trp 


Leu 


Gin 


Thr 


Leu 


Gin 
500 


Gly 


Pro 


Asn 





100 










105 


Thr 


Gly 
115 


Leu 


Gin 


Glu 


Trp 


Leu 
120 


Pro 


Glu 
130 


Lys 


Leu 


Leu 


Ala 


135 


Pro 


Thr 
145 


Gin 


Val 


Lys 


His 


Met 
150 


Gin 


Arg 
160 


Asp 


Phe 


He 


Ser 


Leu 
165 


Val 


Leu 
175 


Ser 


Phe 


Leu 


Glu 


Pro 
180 


Thr 


Cys 

190 


Arg 


Tyr 


Trp 


Arg 


He 

195 


Arg 


Glu 

205 


Lys 


Cys 


Lys 


Glu 


Glu 
210 


Lys 


Arg 
220 


Arg 


Lys 


Val 


He 


Lys 
225 


Lys 


Ser 
235 


Ala 


Tyr 


He 


Arg 


Gin 
240 


Arg 


Gly 
250 


Glu 


Leu 


Lys 


Ser 


Pro 
2S5 


His 


Val 
265 


He 


Thr 


Cys 


Leu 


Gin 
270 


Gly 


Ser 
280 


Asp 


Asp 


Asn 


Thr 


Leu 

285 


Lys 


Cys 
295 


Leu 


Arg 


Thr 


Leu 


Val 
300 


Ser 


Gin 
310 


Met 




Asp 


Asn 


He 
315 


Thr 


Leu 
325 


Lys 


Val 


Trp 


Asn 


Ala 
330 


Leu 


Tyr 


Gly His 


Thr 


Ser 


Thr 




340 










345 


Lys 


Arg 
355 


Val 


Val 


Ser 


Gly 


Ser 
360 


Asp 


He 


Glu 


Thr 


Gly Gin 


Cys 




370 










375 


Ala 


Ala 
385 


Val 


Arg 


Cys 


Val 


Gin 
390 


Gly 


Ala 
400 


Tyr 


Asp 


Phe 


Met 


Val 
405 


Thr 


Cys 
415 


Leu 


His 


Thr 


Leu 


Gin 
420 


Leu 


Gin 
430 


Phe 


Asp 


Gly 


He 


His 
435 


Ser 


He 
445 


Arg 


Val 


Trp 


Asp 


Val 
450 


Leu 


Thr 
460 


Gly 


His 


Gin 


Ser 


Leu 
465 


Asn 


He 
475 


Leu 


Val 


Ser 


Gly 


Asn 
480 


Asp 


He 


Lys 


Thr Gly Gin 


Cys 




490 










495 


Lys 


His 
505 


Gin 


Ser 


Ala 


Val 


Thr 
510 
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Cys Leu Gin Phe Asn Lys Asn Phe Val lie Thr Ser Ser Asp Asp 

515 520 525 

Gly Thr Val Lys Leu Trp Asp Leu Lys Thr Gly Glu Phe lie Arg 

530 535 540 

Asn Leu Val Thr Leu Glu Ser Gly Gly Ser Gly Gly Val Val Trp 

545 550 555 

Arg lie Arg Ala Ser Asn Thr Lys Leu Val Cys Ala Val Gly Ser 

560 565 570 

Arg Asn Gly Thr Glu Glu Thr Lys Leu Leu Val Leu Asp Phe Asp 

575 580 585 

Val Asp Met Lys 



<210> 30 

<211> 3375 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 708398CB1 

<400> 30 

ggggagaagg gagcccgcag gatcaggggt 
cctcatctca gggccgccaa cttccagctg 
accctcctgc ctgggcccgc agccgccgcc 
ctccgggagg cgggccagtg tttcgagagt 
acagatcgcg agcggctgcg gaccatttat 
cccgcccctg gcttctcctc catgctgtat 
accaagactc gggtcaggtt cttcagactc 
aagaggagaa tgaagctggg gtcagatatc 
atcttttatg acagggctga gtatcttcat 
gggccccatg aagcccgaga tgggcagctc 
gtgctgaccc tgaggcttcg gaatggcgga 
ccactctgcc ggacacccca gtttgctttc 
ctgggccccg gtgaatgcta tgaactccat 
ttcccagcca cagtgctctg ggagctgctg 
ggcacattct acattgcccg cctcttggct 
ctgaagccca tgactccctt caagcggacc 
cggatagagg aaggagagag acctgaccgc 
gcgctgggga catactaccc acctccccgc 
ggaacaagta tcttcactgc ccctaaggag 
gccctgaagt ggaggaacta tgaggtgaag 
cagatggagc atgatatccg gcactatgac 
gtggaccaga accccaggct gctcacgctg 
tcagtgctac ggggcgacca cctgtttgcc 
cccatcacat ataagggctt tgtgcacaag 
tccatgagcc tcctgagccg ctttgtggat 
aaccgccagc cgctgcgagt ccagcaccgt 
tggcccatgc tctttcctgt ggcacctcgg 
ctcaagctgt acgaccggag tctggagtca 
attgttacgg gcaccacccg tccagccccc 
aagactgtca cgttagtgga ggcaattaag 
atcttggcct gcgctccatc caactcaggg 
caccttccta gctccatcta ccgcctcctg 



cagagttagg gggcttccct cctgctgcac 60 
cagcggcgac tttcagtttc atttccacgg 120 
gcgatgccca gtaagttcag ctgccggcag 180 
ttcctggtcg ttcggggact ggacatggag 240 
aaccgcgact tcaagatcag ctttgggacc 3 00 
ggaatgaaga ttgcaaatct ggcctacgtc 360 
gaccgctggg ccgacgtgcg gttcccagaa 420 
agcaaacacc acaagtcact gctagccaag 480 
gggaaacatg gtgtggatgt ggaagtccag 54 0 
cttatccgcc tggatttgaa ccgcaaagag 600 
acccagtctg ttaccctcac tcacctcttc 660 
tacaatgaag accaggagtt gccctgtcca 720 
gtccattgta agaccagctt tgtgggctac 780 
ggacctgggg agtcgggttc agaaggagcc 840 
gccgtcgccc acagccccct ggctgcacag 900 
cggatcaccg gaaaccctgt ggtgaccaat 960 
gctaagggct atgacctgga gttaagtatg 1020 
ctcaggcagc tgctccccat gcttcttcag 1080 
atcgcagaga tcaaggccca gctggagaca 1140 
ctgcggctgc tgctgcacct ggaggaactg 1200 
ctggagccgg tgcccatgac ctgggaccct 126 0 
gaggttcctg gagtgactga gagccgcccc 132 0 
cttttgtcct cggagacaca ccaggaggac 1380 
gtggaattgg accgtgtcaa gctgagcttt 144 0 
gggctgacct tcaaggtgaa ctttaccttc 1500 
gccctggagc tgacagggcg ctggctgctg 1560 
gacgtcccgc tgctgccctc agatgtgaaa 1620 
aacccagagc agctgcaggc catgaggcac 1680 
tacatcatct ttgggcctcc aggcaccggc 1740 
caggtggtga agcacttgcc caaagcccac 1800 
gctgacctac tctgtcaaag gctccgggtc 1860 
gcccccagca gggacatccg catggtacct 1920 
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gaggacatca agccctgctg caactgggac 
aagaagaagc tgcaggaata ccgggtctta 
gtctcggccc agtttcccat tgatcacttc 
tgcatggagc ctgagagtct ggtagctata 
gatccaggag ggcagctggt gctggcagga 
tccccactga cccagaagca tggactggga 
aactccctgt acaagaaggg ccctgatggc 
cgcaactaca ggtctcatcc caccatcctg 
gagctgcagg cctgtgctga tgtcgtggat 
cctcgacagg gctttcccat catctttcac 
aacagcccat ccttcttcaa ccctgaagag 
ctcctggccc cctcctccaa gaagggcaaa 
atctccccgt accggaaaca ggtggagaaa 
gagcttcgag gactggatga catcaaggac 
ggccaagaac gaagcgtcat cctcatctcc 
ctggatctgg actttaatct gggtttcctt 
acccgggcca aggccctgct catcatcgtg 
gactggaaag tattcctgga gttctgtaaa 
cctgccaaac tggacctgca acagggacag 
ccctctacct cagggcccca cagccatgac 
ggcctgtctc tgcaagtgga gccagagtgg 
ccttctcgca ccagccaagc cttaactgcc 
ctgcccctcc aagggacagg aaggctgggg 
ccctcccctg ctggggagaa tgacacatca 
agaaaactct gaaac 



gcaaagaagg gggagtatgt atcccccgcc 1980 
attaccaccc tcatcactgc cggcaggttg 2040 
acacacatct tcatcgatga ggccggccac 2100 
gcagggctga tggaagtaaa ggaaacaggt 216 0 
gaccctcggc agctggggcc tgcgctgcgt 222 0 
tactcactgc tggagcggct gctcatctac 2280 
tatgaccccc agttcataac caagctgctc 2340 
gacattccta accagctcta tcatgaaggg 24 00 
cgagaacgct tctgccgctg ggcgggccta 2460 
ggcgtaatgg gcaaagatga gcgcgaaggc 252 0 
gctgccacag tgacttccta cctgaagctg 2580 
gctcgcctga gccctcgaag tgtgggcgtc 2640 
atccgttact gcatcaccaa acttgacagg 2700 
ttgaaggtgg gttcagtaga agaattccaa 2760 
accgtgcgaa gcagccagag ctttgtgcag 2820 
aagaacccca agaggttcaa tgtagctgtg 2880 
gggaaccccc ttctcctggg ccatgaccct 2940 
gaaaacggag ggtataccgg gtgtcccttc 3000 
aatttactgc aaggtctgag caagctcagc 3060 
tacctccccc aggagcggga gggtgaaggg 3120 
aggaatgagc tctgaagaca cagcacccag 3180 
tgcctgaccc tgaaccagaa cccagctgaa 3240 
gagggagttt acaacccaag ccattccacc 3 3 00 
agctgctaac aattggggga aggggaagga 3360 
3375 



<210> 31 

<211> 2434 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1259937CB1 

<400> 31 

caaggatccg atgggtatat ggagtgtgag 
ggggtttttg agaccagggt ttggaagaga 
cccatgtgca ggcgacacat gaggcagtaa 
aatgtaggga agagcaatgg attgaggtcc 
gggaggaggg acagagtcag taccagagtc 
agtcaggaga cttgcctggc aggcgctgcc 
tgaactgcat ccctgctgtg ggctgggaca 
ggggaaagga gcaggaggga ttccaactct 
gttacttaac ctctctgagc ctcatttatt 
ttttgtcaaa ggcgcattgt gaatatttag 
atggtagctg gaggaaaagg aagagaaaac 
agaactgctc tggacctatc tggcagatgc 
ttgccccgcc cttagacatg ttgtgtcttc 
tgagctcagg tgaatgtggc aagcagagcc 
gtgctcctgg tgacacaggg acctcacaat 
cccagcctta ttctctcgtt cctcttcctc 
caaccaggac gagccccttc ctggcagcag 
ctatgacatc tgccggccag accactcagt 



gtaatggatc attcatgtgg aaggatgcag 60 
gttcagcact gctggtagtt ttgggaatca 120 
ggaactctgc aggggtccct gagatttgga 180 
gaacctggag gatctgctat acgcagagct 240 
ggaaaaaagc agggtgggaa ggggaacctg 3 00 
ctgccagcag aggcctgaca gtggtttcca 36 0 
gggccactga cacagtatcg gagcacagaa 420 
gccagttagc agctgtgtgg ctttgggcat 480 
tcatccataa aatggaaata aaaataatac 54 0 
atcctcagaa taatgcctgg cttgtagcaa 60 0 
caaagtcagc agctgaagga ttttcatatt 660 
agaagcacac acacacggag gggcatggat 720 
tcctggatcc ttggtcccag gtgccttacc 780 
ctctggtggt gtgaatgctg tgtggcgccc 840 
ccctccctcc acggtctcct ctcatgtcct 900 
ccaggcccgg aacttgcctg tttggctccc 96 0 
ctgtgccatc caagttgggg ataaagtccc 1020 
gttgaccctg cagctgcctg tgacagcctc 1080 
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cgtgagagag gtgatggcag cgttggccca 
ggtgaaggtc aattctgcag gtgatgccat 
cacatctctg gggctcaatg agcgtctctt 
gatcccacac cctgaccagc tggggcccac 
gagtgccaag gacctggcag gccagctgac 
ccaccaggtg gagctgatcc actatgtgct 
cgccaacctg gagcgcttca tgcgccgctt 
gctgtgtctc tgccccgtgc ccggcccccg 
ggcggcccac ctcaaggagc agaagaatct 
cagcaactcg gccatcagcc gcctagccca 
gaagctgtac tccgccctcg agaggctgct 
actggccctc gccaagctct cccctcctgt 
catgaccttc attcatgagg gaaaccacac 
gatgagaatg atggccagag ccgcgcggat 
gcctctctca ccactcagaa gccgagtttc 
gatttccaca tgctcggagc agtccctgag 
tgtccagcag ctgaaggtca ttgacaacca 
ggagccatga ggaggggctg ggactggagc 
gggtgtgccg ggccaagata ctcacaggct 
tggactcgag tccctggagc aggcagtgtg 
ctaaggagga cctcggagtg gacccgagcc 
gggaggacag agaggcccca ggagtgggtg 
aatagagagg tctccacacc agaaaaaaaa 



ggaggatggc tggaccaagg ggcaggtgct 114 0 
tggcctgcag ccagatgccc gtggtgtggc 12 00 
tgttgccaac ccacaggaag tgcatgagct 1260 
tgtgggctct gctgaggggc tggacctggt 1320 
ggaccacgac tggagcctct tcaacagtat 13 80 
gggcccccag catctgcggg atgtcaccac 1440 
caatgagctg cagtactggg tggccaccga 15 00 
ggcccagctg ctcaggaagt tcattaagct 15SO 
caattccttc tttgccgtca tgtttggcct 1620 
cacctgggag cggctgcctc acaaagtccg 16 8 0 
ggatccctca tggaaccacc gggtataccg 1740 
catccccttc atgccccttc ttctcaaaga 1800 
actagtggag aatctcatca actttgagaa 1860 
gctgcaccac tgccgaagcc acaaccctgt 1920 
ccacctccac gaggacagcc aggtggcgag 1980 
cacccggagt ccagccagca cctgggctta 2 04 0 
gcgggaactc tcccgcctct cccgagagct 2100 
tggagcaggc acttgcagcc gggaaagcca 2160 
ggccacagct gggcaaggct ctccgtggag 222 0 
gaggcagcca tcccctgtga tgactggcag 22 8 0 
aggaataacg aatgacccaa ggccaaggaa 2340 
gagagtggag tgcgctggga cgttgtgtgc 24 00 
aaaa 2434 



<210> 32 

<211> 892 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1452285CB1 

<400> 32 

cacgcctctc tctgcaccta cacttgcgct 
gctgcgcttc cctggtcagg cacggcacgt 
caaggagcac ctgtacaagt tgctggtgat 
catcaagcgc cacgtgcacc agaacttctc 
cttcgcgctc aaggtgctcc actgggaccc 
tatcgcaggt caagaaagat ttggaaacat 
tgcatttatt gtcttcgatg tcaccaggcc 
aaatgatttg gactccaagt taagtctccc 
ggccaacaaa tgtgaccagg ggaaggatgt 
gttctgcaag gagcacggtt tcgtaggatg 
cattgatgaa gcctccagat gcctggtgaa 
ggagtctatt gagccggacg tcgtgaagcc 
ctccggctgt gccaaatcct agtaggcacc 
tgttccacaa attgtgcctc tatttttacc 
acatgtggca agccaaagat ctatgcctcc 



ccccaagtct ctctcgtgcg cagagcccag 6 0 
ctggccggcc gccaggatgc aggccccgca 120 
tggcgacctg ggcgtgggga agaccagtat 180 
ctcgcactac cgggccacaa tcggcgtgga 240 
ggagactgtg gtgcgcctgc agctctggga 300 
gacgagggtc tattaccgag aagctatggg 360 
agccacattt gaagcagtgg caaagtggaa 420 
taatggcaaa ccggtttcag tggttttgtt 480 
gctcatgaac aatggcctca agatggacca 540 
gtttgaaaca tcagcaaagg aaaatataaa 600 
acacatactt gcaaatgagt gtgacctaat 660 
ccatctcaca tcaaccaagg ttgccagctg 720 
tttgctggtg tctggtagga atgacctcat 780 
attttgggta aacgtcagga tagagatacc 84 0 
atatgtgcct ttctgttagc tt 8 92 



<210> 33 
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<211> 2288 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1812894CB1 

<400> 33 

cggctcgagc ccagatggag gcaacagtac 
ggcctggggt aagagatggg aagagaggct 
cccggtccca ggaaggggca aatggtgagg 
ctgagctggc aggcgccctg gcagaaatgg 
agagcacgaa ggccgaggct gtggctgcgg 
cgctgcaggc catcctgaaa gactccatca 
agcaggagcg acagcagcag cagcaggact 
tgaagcagct gctgtcccgg gcctaccccc 
cccacgagga ctcggagaag ctgcgggaga 
agctgaaggc gaagctgctg agggccgagg 
ggcatgcccc ttccctgcac ggctccacgg 
ccccgctgga gcctctggag gagctgagcg 
ctcacaactg cgatgacagc gcctccatct 
gcagctcctc cctgccccaa agccgccagg 
cgctggtgtc tacgggcacc ctggttcccg 
agctcgtccc agacactcag tgggagcagc 
acctggagag cgtcagtcgc gagcgggacg 
aggactgtgc caagcagatg caggtgctcc 
tgcggaccct gcaagggacc gtgagccagg 
agctggtcac cacccacaag tgcctgcacc 
aagggctccg ggccgagcag ctgccatcct 
gcgaggagga atcactgccc agctctgtgc 
ggcaagaggc gagggcccgg ctgcaggccc 
agatcgtgac gctgcgggag gctctggagg 
ggcagctgag ggtgcagcgg gaggagacag 
ggacagagat ggagcgggtg cagcaggaac 
cagaacagag ggccaaggtg ctgcggctgc 
agagggattt cgtgcgactg tcccaggccc 
ctgagaccct ggagcaagtg cgcagcatca 
acatcaagga cacctgaggg gtcaggatat 
ccocactcct gaaccatgag gcctcgctct 
gagactgggg ttgaggggac tgacgggggc 
ccagggtggc tgggcctcct gttctcaggg 
cccggaacca aaggtgcagg ctcaggcctg 
gctccagccc tcccctgccc ccagcccgtc 
gctgggagtc cccatcagag gcagtgaggt 
tctggtctgc atgacactaa gacgcttgtc 
acagaggcaa ggccagactt ttccgtcgtt 
aaaaaaaa 



taggttgcag gacttgctaa gccggtaggt 60 
gatccctgtc ccccacagca ctggaggact 12 0 
ccgagtcagg tgagctcagc cggcttcggg 180 
aaaccatgaa ggctgtggca gaggtgagcg 240 
tgcagcggca gtgccaagag gaggtggcct 300 
gcagctatga agcccagatc accgccctga 360 
gtgaggagaa ggagcgggag ctgggccgcc 42 0 
tggactccct ggagaagcag atggaaaagg 480 
tcgtactgcc catggaaaag gagatcgagg 54 0 
agctgattca ggagatccag agacgtcccc 600 
agttgctgcc cctgtcccgg gatccatcgc 660 
gagatggggg tccagccgct gaggccttcg 72 0 
cctccttctc ccttggcggt ggggtcggca 78 0 
gcctgagccc tgaacaggaa gagacggcct 84 0 
agggcatcta cctgccccct cctggctacc 900 
tgcagacaga gggccgacag ctgcagaagg 960 
agctccaaga gggcctgaga cggagcaatg 102 0 
tggcccaggt ccagaactca gagcagctgc 1080 
cccaggagcg ggtgcagctg cagatggcgg 1140 
atgaggtaaa gcggttgaat gaggaaaacc 1200 
cagcccccca gggctcgcag caggagcagg 1260 
cagagctgca gcagctgctg tgctgcacgc 1320 
aggagcacgg ggccgagcgc ctgcggatcg 1380 
aggagacagt ggccagggcc agcctggagg 144 0 
aggtgctgga ggcctccctg tgcagcctga 1500 
agagcaaggc ccagctccca gacctcctct 1560 
aggcagagct ggagaccagt gagcaggtgc 1620 
tgcaggtgcg cctagagcgg atccgccagg 1680 
tggatgaggc gccactcacg gacgtcaggg 174 0 
ccccaccccc accctgggaa agacgccttt 1800 
ggggtcttgg atggcttttc caccgtccct 1860 
caccaccgcc ccgccctcca gcgcctcctc 1920 
atcacacctg ggtgaggggc ccaagcccct 1980 
cggctttctg gctgctgtgc tgcctcctgg 2040 
ccctgcccag ggcacagcgg agccatgggg 2100 
gggccccggc cctgggacag gcagctgcct 2160 
cacagcggcg acccaggcct ccaagcttgc 2220 
tattttcaat aaataagcag ctcagcgcaa 2280 
2288 



<210> 34 
<211> 1813 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> unsure 

<222> 1708, 1711, 1713, 1715 

<223> Incyte ID No: a or g or c or t, unknown, or other 



<220> 

<221> raisc_feature 

<223> Incyte ID No: 3074884CB1 



<400> 34 

gcacgaagga ggcccggctt caggtggccc 
acgccgagga ggagctgctg cggggagagc 
gtctggttca ccctgacgtc aagtggggcc 
ggcaggtggc ggaggccaca gcgctggtgc 
caatggtcgt gtccaccaaa acgccggaca 
agcacctgac agaaaagatc cgagggtctc 
agaggatggc tgccccgacc aagaaagaac 
accgcttcac ggtcgtcctg cacatcttcc 
ttcaggtggc cctggcggag atgccgctgc 
acctgtaccg aggagtcggc tcgcgctaca 
tgcagcagcg tctcctgaga gagaaggagg 
gcaagaagag gcacctgctc cgccggcagc 
tggtggggta caccaactgc ggaaagacca 
ccatccagcc acgggaccag ctgtttgcca 
tgccctcacg catgaccgtc ctgtacgtgg 
acggcctcat cgagtccttc tccgccaccc 
tgcacgtgag ggacgtcagc caccccgagg 
cgctgcgtgg cctgcagctg cccgccccgc 
aggtggacct cgtgcccggg tacagcccca 
tgcggggcca cgggctccag gagctgaaag 
cggggagaca gatcctcact ctccgtgtga 
ataaggaggc cacagttcag gaggtggacg 
gggtcatcat cagcaactca gcctacggca 
acgcccacag aggcctgcgg ggtgggggca 
tgtgttgggg gcagcttggt gtcaggtgca 
gtctcgctcc cagccatttg ctgggatgac 
cccaaagcgg gccgcccgag cgcccactcc 
ccctcggtgc ctgctgtgaa ctgctttccc 
ctttgttttt acttccgtga aaaaaaanac 
tagaggattc caagccttac cgtaacgcgt 
aatagggggt cac 



tggcggagat gccggaagat gcggacgaga 6 0 

ctctgctgcc ggcggggacc cagcgcgtgt 12 0 

cggggaagtc gcagatgact cgagccgagt 18 0 

acacgctgga cggctggtcc gtggtgcaga 24 0 

ggaagctcat ctttggcaaa gggaactttg 300 

cagacgtcac gtgcgtcttc ctgaacgtgg 36 0 

tggaagccgc ctggggcgtg gaggtgtttg 42 0 

gctgtaacgc ccgcacgaag gaggcccggc 48 0 

acaggtcgaa cttgaaaagg gacgtcgccc 54 0 

tcatggggtc aggagaatcc ttcatgcagc 600 

ccaagatcag gaaggccttg gacaggcttc 66 0 

ggacgaggcg ggagttcccc gtgatctccg 72 0 

cgctgatcaa ggcactgacg ggcgatgccg 78 0 

cgctggacgt cacggcccac gcgggcacgc 84 0 

acaccatcgg cttcctctcc cagctgccgc 900 

tggaagacgt ggcccactcg gatctcatct 96 0 

cggagctcca gaaatgcagc gttctgtcca 1020 

tcctggactc catggtggag gttcacaaca 1080 

cggaaccgaa cgtcgtgccc gtgtctgccc 1140 

ctgagctcga tgcggcggtt ttgaaggcga 12 00 

ggctcgcagg ggcgcagctc agctggctgt 1260 

tgatccctga ggacggggcg gccgacgtga 13 2 0 

aattccggaa gctctttcca ggatgaacgg 13 80 

tcgctgcctg gggagctgag gcgttaccgc 1440 

gcagggtcct ccttgtctgg ttctgcaccc 1500 

cgtgcaggcc ggtgacacgg ccgcacctgc 1560 

aagcctgagc atccacacaa ttccagtggg 1620 

tcggaatgtt tccgtaacag gacattaaac 16 80 

ngngnaaaaa aaaaaggggc ggcccgctcc 1740 

tgcattggcg agcggtcata agcttcttct 1800 
1813 



<210> 35 

<211> 1733 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3452277CB1 



<400> 35 

ctacggcctg gaccgagtga ccaatccgaa tgaagtcaag gtaaaccaga aacaaacagt 6 0 
cgttgctgtc aaaaaagaga tcatgtatta ccaacaggcc ttgatgaggt ccacagtgaa 12 0 
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gtcttctgtg tccctgggag ggattgtgaa atacagngag cagttctcat ccaacgatgc 180 

catcatgtca ggctgcctcc ccagcaaccc ctggatcacc gatgacaccc agttctggga 240 

cttaaatgcc aaactggtgg aaatcccaac caagacgcga gtggaacgat gggccttcaa 300 

cttcagcgaa ttgatccgag accccaaagg tcgacagagc ttccagtact tcctcaagaa 360 

agaattcagt ggagagaatc tgggattctg ggaagcctgc gaggatctga agcatggaga 42 0 

tcagtccaaa gtcaaggaga aagcagagga gatttacaag ctgttcctgg ccccgggggc 480 

gaggcgctgg atcaacatag atggcaaaac catggacatc acagtgaagg ggctgaagca 54 0 

cccccaccgc tatgtgctgg acgccgcaca aacccacatt tacatgctca tgaagaagga 6 00 

ttcttatgct cgctatttaa aatctccgat ctataaggac atgctggcca aagctattga 660 

acctcaggaa accaccaaga aaagctccac cctccctttt atgcggcgtc acctgcgctc 720 

cagcccaagc cctgtcatcc tgagacagct ggaagaggaa gccaaggccc gagaagcagc 78 0 

caacactgtg gacatcaccc agccgggcca gcacatggct cccagccccc acctgaccgt 840 

gtacaccggg acctgcatgc ccccgtctcc ttctagcccc ttctcctoct cctgccgctc 900 

ccccaggaag cctttcgcct cacccagccg cttcatccgg cgacccagca ccaccatctg 96 0 

cccctcaccc atcagagtgg ccttggagag ctcatcgggc ttggagcaga aaggggagtg 1020 

cagcgggtcc atggcccccc gtgggccctc tgtcaccgag agcagcgagg cctccctcga 1080 

cacctcctgg cctcgcagcc ggcccagggc ccctcctaag gcccgcatgg ctctgtcctt 1140 

cagcaggttt ctgagacgag gctgtctggc ctcacctgtc tttgccaggc tctcacccaa 1200 

gtgccctgct gtgtcccacg ggagggtgca gcccctgggg gacgtgggcc agcagctgcc 1260 

acgattgaaa tccaagagag tagcaaactt tttccagatc aaaatggatg tgcccacggg 1320 

gagcgggacc tgcttgatgg actcggagga tgctggaaca ggagagtcgg gtgaccgggc 1380 

cacagaaaag gaggtcatct gcccctggga gagcctgtaa ggaaagaggc aggctgagct 1440 

gggggctctg gaccaggaag atgctctgac agatgccatg gtatgggcca caggacacac 15 00 

ttgctcgaga accaaagtgc atttgggtga catttgaaga ttggggagac aagatggggt 1560 

agattgtggc aaagaacgct ctggctggtt accaggggcc aactccttct cctcttcctg 1620 

accctccctc ccctgggcag aagaaacgca tgtggaccag aagactttcc ctgctgcctt 1680 

aaaaccaata aaaggttaac tttaagtttc ttggaaaaaa aaaaaaaaag ggg 1733 



<210> 36 

<211> 1776 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4203832CE1 

<400> 36 

cccagccgag cccagcctag cccgagccca gcccgagcga accgcgagcc ccaagcccga 60 

gccgcgccca gcccgagcag agccctccag ccgctcaccc cgcgtgccac cccagcgacc 120 

ctcagccgct ctctgccctt ctctcggccc cgcgcccgcc ctcgcggccc ctctgcccaa 18 0 

tgaaactggc cgcgatgatc aagaagatgt gcccgagcga ctcggagctg agtatcccgg 24 0 

ccaagaactg ctatcgcatg gtcatcctcg gctcgtccaa ggtgggcaag acggccatcg 3 00 

tgtcgcgctt cctcaccggc cgcctcgagg acgcctacac gcctaccatc gaggacttcc 360 

accgcaagtt ctactccatc cgcggcgagg tctaccagct cgacatcctc gacacgtccg 420 

gcaaccaccc gttccccgcc atgcggtgcc tctccatcct cacaggagac gttttcatcc 480 

tggtgttcag tctggacaac cgcgactcct tcgaggaggt gcagcggctc aggcagcaga 540 

tcctcgacac caagtcttgc ctcaagaaca aaaccaagga gaacgtggac gtgcccctgg 600 

tcatctgcgg caacaagggt gaccgcgact tctaccgcga ggtggaccag cgcgagatcg 660 

agcagctggt gggcgacgac ccccagcgct gcgcctactt cgagatctcg gccaagaaga 720 

acagcagcct ggaccagatg ttccgcgcgc tcttcgccat ggccaagctg cccagcgaga 780 

tgagcccaga cctgcaccgc aaggtctcgg tgcagtactg cgacgtgctg cacaagaagg 84 0 

cgctgcggaa caagaagctg ctgcgggccg gcagcggcgg cggcggcggc gacccgggcg 900 

acgcctttgg catcgtggca cccttcgcgc gccggcccag cgtacacagc gacctcatgt 96 0 
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acatccgcga 
gctaggagcc 
ccaacggccc 
gatccgcccc 
gaaggaaaga 
ccgccacccc 
gacctaagag 
tcaagaaacc 
ttccgggttg 
tgggtcaaat 
aaaacaaaac 
acccaggtgt 
ttaatataca 
aaaaaaaaaa 



gaaggccagc 
ccgccgcgct 
ggtgcgcccc 
cagcactggg 
gaacgggcca 
cataactttg 
tgagcgcggg 
gtcatgcccg 
ctgtgacagc 
catagccaag 
ttgcacttta 
ttatactgtg 
ataaaataat 
aaaaaaaaaa 



gccggcagcc 
ggcgacacaa 
aggccgggag 
gaggcgccac 
agactgggac 
ggagcgaggg 
gtgggggagg 
cttgagggtc 
cggtagagcc 
tgacttgttt 
acggtagttc 
tgtgtgtgag 
ttaaaatgga 



aggccaagga 
cctaaggagg 
cgcgcgcgga 
tgaaccgaga 
tattccccac 
cccagccgag 
gatgtgaagt 
aggacccacg 
tctgccctcc 
acatgtgagt 
cggtgtcaac 
gtctttaaag 
aaaaaaaaaa 



caaggagcgc 
acctttttgt 
ctggcgtctc 
agggacggtc 
ccccggtccc 
ggtggattta 
tatccagcct 
gggcattatc 
cgaaactaag 
gaaactgcac 
atggacacga 
ttattgcttt 
aaaaaaaaaa 



tgcgtcatca 1020 
taagtcaaat 1080 
ccctcccggc 1140 
atctgctccg 1200 
ccattgaggc 1260 
tcttctcaaa 1320 
ctgctaggct 1380 
ttgtctgtga 1440 

cgggggggcg isoo 

aaaggaacac 1550 
acaaaacctt 1S20 
atttggtttt 1680 
aaaaaaaaaa 1740 
1776 



<210> 37 

<2il> 1316 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 104368CB1 

<400> 37 

cggacgcgtg gggcggcttt tctctttagc gctcagggcg ctgacccact gctcttcctc 6 0 
ttaagaaagt gctccattcc ttccggcgcc cggagctgct ggcccaaagg gatccggagc 12 0 
gagctagggc agacaccatg accacccttg atgataagtt gctgggggag aaactgcagt 18 0 
actactatag cagcagtgag gatgaggaca gtgaccacga ggacaaggac cgaggcagat 24 0 
gtgccccagc cagcagttct gtgcctgcag aggctgagct ggcaggcgaa ggcatctcag 300 
ttaacacagg cccaaaaggt gtgatcaatg actggcgccg cttcaagcag ttggagacag 360 
agcagaggga ggagcagtgc cgggagatgg aaaggctgat caagaagctg tcaatgactt 420 
gcaggtccca tctggatgaa gaggaggagc aacagaaaca gaaagacctc caggagaaga 480 
tcagtgggaa gatgactctg aaggagtttg ccataatgaa tgaggaccaa gatgatgaag 54 0 
agtttctgca gcagtaccgg aagcagcgaa tggaagagat gcggcagcag cttcacaagg 600 
ggccccaatt caagcaggtt tttgagatct ccagtggaga agggttttta gacatgattg 660 
ataaagaaca gaaaagcatt gtcatcatgg ttcatattta tgaggatggc attccaggga 72 0 
ccgaagccat gaatggttgc atgatctgcc ttgccgcaga gtacccagct gtcaagttct 78 0 
gcaaggtgaa gagctcagtt attggcgcca gcagtcagtt caccaggaat gcccttcctg 84 0 
ccctgctgat ctataagggg ggtgaattga tcggcaattt tgttcgtgtt actgaccagc 90 0 
tgggggatga tttctttgct gtggaccttg aagcttttct ccaggaattt ggattactcc 96 0 
cagaaaagga agtcttggtg ctgacatctg tgcgtaactc tgccacgtgt cacagtgagg 102 0 
atagcgacct ggaaatagat tgaactgata gtctagttgc atagatttct cattgtttgg 1080 
gttggaatac acgtcattgt ttatttttgt tcctttgtct tctggctttt cagctgttct 1140 
ttgtagtccc ttttattatg cataaaataa agaaattctt agattaaatc agaatgctga 12 00 
ataaccttgt agctagcaat aaggtgactt acagttgtat aaacaggaag ccaggctttt 1260 
gaactgttta cttaagattc tgtggtgtga catctctgtt attgtttcca gtcaat 1316 



<210> 38 

<211> 1554 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: L441680CB1 



<400> 38 

gtggcttgtg gagtggcgac cgttagtgag 
aggaggagcc cgagccgtaa gggaagccgt 
agccgagcac tgtataaatg acaccgcatt 
tgccggaagc agattactgg tttatgacac 
gggacacaaa gacactgtgt actgtgtggc 
tggatcagct gacaaaagcg ttattatctg 
cacgcacaat gatgctatac aatgtgtctc 
ttgttcctcc agtgactttg ggttgtggtc 
atcaagcagc aagatcatct gctgcagctg 
gatgttcaat gggatcatca gcatacggaa 
gcggccgggg ggctccctct cgccaatatg 
acgtaatgac atcctggctg tggctgactg 
tggaaaacag attggaaagg atcgggcact 
tactaaaggc gagtacattt tgctgggggg 
ggatggagtg cggcttggga ctgttgggga 
gaaaccggat tccaactatg tggtggtcgg 
gcttattttc agcacagtcc acggagttta 
gactgacgtc attgtgcagc acctgatcac 
gcttgtcaag aagattgcca tctacagaaa 
cctcatctat gagttgtatt cagaggactt 
gattatcaag aagtttgagt gcaacctcct 
ccaggagaaa cggctgcagt gcctgtcctt 
ggagtctctc attcgttaca tcaaggtgat 
agtggggctg aagaagatgt acttgttagt 
tttttcactc agtactgatg tccttggaaa 
ttattgggtc tttcatcttt tttcttggta 



gcggttgctg agacagacgc tgaggcgggt 60 
gatgagggcc gtgttgacgt ggagagataa 120 
taagcctgat ggaactcaac tgattttggc 180 
ctctgatggc accttacttc agcccctcaa 240 
atatgcgaag gatggcaagc gctttgcttc 3 00 
gacatcaaaa ctggaaggca ttctgaagta 360 
ctacaatcct attactcatc aactggcatc 420 
tcctgaacag aagtctgtct ccaaacacaa 480 
gacaaatgat ggtcagtacc tggcgctggg 540 
caaaaatggc gaggagaaag taaagatcga 600 
gtccatctgc tggaaccctt caagagagga 660 
gggacagaaa gtttccttct accagctgag 72 0 
gaactttgac ccctgctgca tcagctactt 780 
ttcagacaag caagtatctc ttttcaccaa 840 
gcagaactcc tgggtgtgga cgtgtcaagc 9 00 
ctgccaggac ggcaccattt ccttctacca 960 
caaggaccgc tatgcctaca gggatagcat 102 0 
tgagcagaaa gttcggatta aatgcaaaga 1080 
tcgattggct atccaactgc cagagaaaat 114 0 
atcagacatg cattaccggg taaaggagaa 1200 
ggtggtgtgt gccaatcaca tcatcctgtg 1260 
cagcggagtg aaggagcggg agtggcagat 1320 
cggtggccct cctggaagag aaggcctctt 1380 
gtattcattc atattgattg taaaggatta 144 0 
tcttacctgg aaacatgttt gcaaaaaaca 1500 
ttacatattt gttcaataaa aata 1554 



<210> 39 

<211> 2320 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No : 1494955CB1 



<400> 39 

gggcgcccgc gtactcacta gctgaggtgg cagtggttcc accaacatgg agctctcgca 60 

gatgtcggag ctcatggggc tgtcggtgtt gcttgggctg ctggccctga tggcgacggc 12 0 

ggcggtagcg cgggggtggc tgcgcgcggg ggaggagagg agcggccggc ccgcctgcca 180 

aaaagcaaat ggatttccac ctgacaaatc ttcgggatcc aagaagcaga aacaatatca 24 0 

gcggattcgg aaggagaagc ctcaacaaca caacttcacc caccgcctcc tggctgcagc 300 

tctgaagagc cacagcggga acatatcttg catggacttt agcagcaatg gcaaatacct 360 

ggctacctgt gcagatgatc gcaccatccg catctggagc accaaggact tcctgcagcg 420 

agagcaccgc agcatgagag ccaacgtgga gctggaccac gccaccctgg tgcgcttcag 480 

ccctgactgc agagccttca tcgcctggct ggccaacggg gacaccctcc gtgtcttcaa 540 

gatgaccaag cgggaggatg ggggctacac cttcacagcc accccagagg acttccctaa 6 00 

aaagcacaag gcgcctgtca tcgacattgg cattgctaac acagggaagt ttatcatgac 66 0 

tgcctccagt gacaccactg tcctcatctg gagcctgaag ggtcaagtgc tgtctaccat 72 0 
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caacaccaac cagatgaaca acacacacgc 
ctcgtgtggc ttcaccccag atgtgaaggt 
gttccaggag gtggtgcgag ccttcgaact 
tgctttctcc aacgactcac ggaggatggc 
gtgggacaca gatgtggaat acaagaagaa 
ctttgaagag gcggcgggtg ccgcgccgtg 
cttggccttg gccagtggca gtagtattca 
ggagtgcttt gagcgggtcc atggcgagtg 
ccgctttctg gcctcctgtg gggaccgggc 
ccgagccatg gtggaggaga tgcagggcca 
ccagaggctg cagcagcagc tgacccaggc 
gaagaagtga ctctgggagg gcccggcgca 
catggcgctg ctgccatctt tcctcccagg 
ttcttactgg tggccctgct tcttcccatt 
cttcttgctg gctgtgactc ctccctgact 
cccagtgggt ggaatctgtc cccacctggc 
gagagagaga atgtgatttt tggccttgtg 
taaatgttcc agaacaacct agagaacacc 
ggagactggg atagcttccc atcacagaac 
tccttctggg cctcagttct atttgtaaga 
caaagatgat atgaggctaa gagaatatca 
gagtagtact attgtccaat gtcatgaaag 
accaaattag aaacacattc cttgggaagg 
tatatggttg ggacttctct cttcgggaga 
gttcatcaag ttcatcagat atttgagtgc 
ggattaaata cgaataagac atggtttctg 
tacaccctta ttaagtgctt tgtgttagtt 



tgctgcatct ccctgtggca gazttgtagc 780 
ttgggaagtc tgctttggaa agaaggggga 840 
aaagggccac tccgcggctg tgcactcgtt 900 
ttccgtctcc aaggatggta catggaaact 96 0 
gcaggacccc tacttgctga agacaggccg 1020 
ccgcctggcc ctctccccca acgcccaggt 1080 
tctctacaat acccggcggg gcgagaagga 114 0 
tatcgccaac ttgtcctttg acatcactgg 1200 
ggtgcggctg tttcacaaca ctcctggcca 1260 
cctgaagcgg gcctccaacg agagcacccg 1320 
ccaagagacc ctgaagagcc tgggtgccct 13 80 
gaggattgag gaggagggat ctggcctcct 1440 
tggaagcctt tcagaaggag tcccctggtt 1500 
gaaaccactc ttgtctactt aggtctctct 1560 
agtggccaag gtgcttttct tcctcccagg 162 0 
actgaggaga atggtagaga gcagaggaga 16 8 0 
gcagcacatc ctcacaccca aagaagtttg 174 0 
tgagtactaa gcagcagttt tgcaaggatg 18 0 0 
tgtgttccat caaaaagaca ctaagggatt 186 0 
tggagaataa tcctctctgt gaactccttg 1920 
agtccccagg tctggaagaa aagtagaaaa 198 0 
tggtaaaagt gggaaccagt gtgctttgaa 2 04 0 
caaagttttc tgggacttga ccatacattt 2100 
tgatatcttg tttaaggaga cctcttttca 2160 
ccactctgtg cccaaataaa tatgagctgg 2220 
ccatcaaaga tggctggtgg gagagagaga 2280 
tattcatagc 2320 



<210> 40 

<211> 879 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1508161CB1 

<400> 40 

gaagaatttg ttcaggcgtt cgtgcagaag gaccctttgg ataatgacaa gagttgctac 6 0 
agtgaacgga agaaaacacg aaacttagaa gcttacgtgg aatggtttaa tcgcctcagc 12 0 
tacttggttg ctacagaaat ctgtatgcct gttaagaaaa aacaccgagc aagaatgatt 180 
gagtatttca ttgacgtagc tcgggagtgt tttaacattg gcaacttcaa ctccttgatg 240 
gcgataatct ctggtatgaa tatgagccca gtctctcgac taaaaaaaac ttgggccaaa 3 00 
gtgaagactg caaaatttga cattcttgag catcagatgg acccttcaag caatttctat 360 
aattatcgaa cagctcttcg tggggcagca caaaggtctt taactgctca tagtagtaga 420 
gaaaagattg tgataccatt cttcagtctc ttaatcaaag atatttattt cctcaatgag 480 
ggttgtgcca accgccttcc caatggccat gtcaattttg agaaattttg ggaactggcc 540 
aaacaagtga gtgaatttat gacatggaaa caagtggagt gtccatttga gagggaccgg 6 00 
aagatcttgc agtatctgct cacagtacca gtcttcagtg aagatgctct ctacttggct 660 
tcttatgaga gtgaaggacc tgaaaatcat atagagaaag acagatggaa gtctttaagg 72 0 
tcgagcctct taggcagagt ttaacacatg ggagctgcct gcctgctgct gctgctgctt 7 80 
cctgcagatc atggaggggc tggcctttgt tttctggcat ctcgtaccac gaacactcat 84 0 
gaagaccctg cagtcattgg agcacccggg tcagcaaag 8 79 
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<210> 41 

<211> 2248 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1811877CB1 

<400> 41 

ggaggaccag gaggacatca ctgcctttga 
agccagtcgg gcattgctgc tggctcagtg 
gtggaaggcg atacacacgg cccccgtggc 
gctagccaca ggtggctgtg atggggccgt 
gacacaccac ttccgaggct cgcccggtgt 
tacacgcctg ctgctcttct cctcggccac 
ggaccggtca tgcctggctg tgctgactgc 
cagcgccgac ggccacacca tgctcagctc 
ccttcagagc tgccaggcca cgaggaccgt 
gctgttgcca gaggagccag tgtcccagct 
gacagctggc gaccaaggca ctctgcgcgt 
cacgcaggcc cagccgccgg gccctgggca 
cgccggcgtg gtcctcaccg ccaccgccga 
cctgcggctg cagaaacagt tcgctggcta 
tgggcccgag gactcccacg ttgtcgtggc 
gctgcagacg tcagcctgcc agatcctcca 
tgtgttccgg aaggggtggc tctttgccag 
gagaatgaac aaggctggcc aggtgatgtg 
tgtgggcacc gtctgctgct ctaggctgaa 
ctgcactgtg aagctgtggc ctcttcccaa 
caacggccct atcctcctgc aggcccagac 
cagcgtggct attgccccca acgacaagct 
caagctctgg gccctgccac agtgccagct 
cctctggtgc gtccagttct ctcccatgga 
caccatcaag ctctgggcac tccaggactt 
tgcttctgtg ctgaaggtgg cctttgtgag 
ggatggcctc gtgaagctct ggaccatcaa 
ccacgaggac aaggtctggg ggctgcactg 
ggccagtgac tcccgagtca tcctctggaa 
gcaggccagg caagaggagc aggtggtcag 
gaagcggtac ctgcgggcgc tgggcctggc 
gactgtcatc caggccatcc ggagggaccc 
gctccgactg cggcgcgacc agaaagaggc 
caactcgcgg cactgccacg aggcccaggc 
ccccgaggag ctgctggcct acgaaggcgt 
cactgagcgg cactttcagc ggctcagcag 
cctgtggcac aacatgaagc tccctgtgcc 
taaaggcgca ctgccctaaa aaaaaaaa 



cctcagccct gacaacgagg tgctggtgac 60 
ggcctggcaa gagggcagcg ttacccgcct 120 
caccatggcc ttcgacccca cctccactct 180 
gcgcgtctgg gacatcgtgc ggcactacgg 240 
cgtgcaccta gtggccttcc acccggaccc 3 00 
ggatgccgcc atccgcgtgt ggtcactgca 360 
ccactacagc gccgtcacct cactggcctt 420 
cggccgtgac aagatatgta tcatctggga 4 80 
gcctgtgttt gagagcgtgg aggctgctgt 54 0 
gggtgtgaag tccccagggc tgtactttct 6 00 
gtgggaggca gcttctgggc agtgtgtgta 6 60 
ggagctgacc cactgcaccc tggcacacac 720 
ccacaacctg ttgctctacg aggctcgctc 780 
cagtgaggag gttttggatg tccggtttct 840 
ctccaatagc ccctgcctaa aagtgtttga 900 
cggccacacg gatatcgtcc tggccctgga 960 
ctgtgccaag gatcagagcg tccgtatctg 1020 
cgtggctcag ggttccggtc acacacacag 108 0 
ggagtccttc ctggtgacag gcagccagga 114 0 
agccttgctg tccaagaaca cagccccaga 12 0 0 
cactcagcgc tgccatgata aggacatcaa 126 0 
gctggccaca ggctcacagg accgcacggc 1320 
gctgggtgtc ttctcaggcc accggcgtgg 1380 
ccaggtgctg gccacggcct cagctgatgg 144 0 
cagctgtctc aagacatttg aggggcacga 1500 
ccgtggcacg cagctgctgt ccagcggttc 1560 
gaacaacgag tgtgtgcgga cgctggatgc 1620 
cagccggctg gacgaccacg cccccactgg 168 0 
ggatgtgacc gaggcggagc aggcagagga 174 0 
gcagcaagag ctggacaacc tgctgcatga 18 00 
catctccctg gatcggcccc acaccgtgct 1860 
tgaggcctgc gagaagctgg aagccaccat 1920 
cctgctgcgc ttctgcgtca cgtggaacac 1980 
cgtgctgggt gtgctcttga ggcgagaggc 2 04 0 
gcgggcagcg cttgaggccc tgctgcccta 2100 
gaccctccag gccgccgctt tcttggactt 2160 
ggccgccgcc cccaccccct gggaaaccca 222 0 
2248 



<210> 42 
<211> 2146 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 1848674CB1 



<400> 42 

gttattggca agttcccctg cagttgtttg 
cggccagcca tggagcgctc tgggcccagc 
gacccgcagc ttgcggtcac catgggcttc 
gacttggaag caatgtttga acaaactcga 
ctggaagcaa gagaaaaaga ggaagaaatg 
gaagatattg agccaacatc ctcaagatca 
tccagggata cgagcagcag tgaaagtgaa 
attggccctc ctttaccccc taaaatggta 
atcctcggtc ctttacctcc acctcttaat 
gaggaagagg aggaagagga aaatcctgtt 
ctgaagcatg gcactaaaac agtgtctgct 
gtgacaggag gatatgacta tgatgttaag 
tttaaggcat ttcgatccct tcagccctgt 
agtaacacag gagacatgat tcttgttgta 
agagatggtt ttgaagtaat ggaatgtata 
aacaccaagg gtcatacagc aatgcttcat 
gaatttatga cttgctcaaa tgatgcgact 
aagcaaaaaa gtgtgtttaa accacggacg 
tgcacatata gtagagatgg aaacctcata 
atctgggacc gaaatttgac tgttcatcct 
ggcacagaca cttcttgcgt gactttttcc 
ggtgacgatt cattaaaatt atgggacatc 
tcgggtcttc ccaccatgtt cccaatgact 
atagtcactg gtacatctat tcaaagagga 
cgtaggactt tccaaagggt gtatgaaata 
ctgtggcatc caaagctgaa ccagatcatg 
tattacgacc ccaacaagag tcagagggga 
aaggcaaaac aagctgagac tctaactcag 
atgttccgtg agccccgcca acggagtaca 
cccctgaagt cgcataaacc tgaacctcct 
ggaacccacg ggggcactct ctcttcctat 
gatgacagta atcctcggga agccattttg 
tattgggttt ctccagcata ttccaagact 
tctgatgatg aggaagcaaa gaatgagcca 
tcatttgaga gctgtttgca tgagtgggag 
atgctcatga aattaaaaat tcatttttat 



ggctgtccct gtggctggtt ctggggtgtg 60 
gaagtgacag gctcagacgc gtcgggaccg 120 
acggggttcg gtaaaaaagc tcgcacattt 180 
aggacagctg tggaaagaag tcgcaaaaca 240 
aacagagaga aagaattaag aagacaaaat 3 00 
aatgtggtca gagattgctc caaatcatct 360 
cagagttctg actcttctga tgatgagtta 420 
ggaaaaccag ttaattttat ggaggaagat 480 
gaagaagaag aagaagcaga ggaagaagaa 54 0 
cacaagattc ctgactcgca tgagataacg 600 
ttgggtctgg atccctcagg tgcccgtttg 660 
ttttgggatt ttgctggaat ggatgcttct 720 
gagtgccatc agatcaagtc attacagtat 780 
tctggaagct ctcaggccaa ggtgattgac 84 0 
aaaggagacc agtatattgt ggacatggcc 90 0 
actggctcat ggcatcccaa aataaaggga 96 0 
gtgaggacgt gggaagttga aaatccaaag 1020 
atgcaaggca aaaaagtcat tcccactacg 1080 
gcagctgcct gccagaatgg aagcatacag 1140 
aagttccact ataaacaggc tcatgactcg 1200 
tatgatggta atgtccttgc ctctcgtgga 1260 
cgacaattta ataaaccact tttttcagcc 1320 
gactgctgtt tcagtccaga tgataagctc 1380 
tgtggcagcg gcaaacttgt tttctttgag 1440 
gacatcacag atgcgagtgt tgttcgctgc 1500 
gttggaactg gaaatggatt ggctaaagtc 1560 
gcaaaattat gtgtggttaa aacccagcgg 1620 
gactacatca tcacccctca tgccttgcct 1680 
aggaaacagc tggagaagga cagactggat 1740 
gtagcaggcc caggtcgtgg tggccgagtt 18 00 
attgtgaaga acattgcttt ggacaagacc 1860 
cgtcatgcca aagcagcaga agacagccca 192 0 
cagcccaaaa ccatgtttgc ccaagttgaa 1980 
gaatggaaaa aacgtaaaat ttgaagaatc 2040 
gggtatggga caggtttggg tttttttttt 2100 
gaaaaaaaaa aaaaaa 2146 



<210> 43 
<211> 714 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2012970CB1 

<400> 43 

ggacggcgag cagcggaggc gaggcggtga cgagagcagc ggctccgcca ttggacgagg 6 0 
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aggcctgagg gacgggccag 
gccagggcca tggaggccaa 
aaagtcatct ccatgggcaa 
gagaaaagat tcgtgtctaa 
gtacacgtca gagacagaga 
ttctatgagg ttcgaaatga 
gttgggcaga aagactcctt 
cttggacctc atggaaacat 
tgtaccaaac atcgctgtgt 
ttcctgtact ttgaaacttc 
catcttggat agaactaatg 



cggtgcacaa gaagagaccg 
catgccgaag cggaaggagc 
cgccgaagtg gggaaaagct 
atacctggca acaattggaa 
aatcaaagtt aacatctttg 
gttttacaag gacacacagg 
tgacgccctt gatgcgtggc 
ggaaaatatt atatttgtag 
agatgaaagt gaaggacgtc 
agcacaaact ggagaaggca 
gataaattag tctgtttaaa 



aggcgggtgg ccccgagaga 12 0 
ctggcaggtc tctccgtatc 180 
gtattacaaa gcgatactgt 240 
ttgactatgg agccacaaag 300 
atatggccgg acatcccttc 360 
gtgtgacact ggtctatgat 420 
tggcagaaat gaagcaagag 48 0 
tttgtgccaa caagattgat 54 0 
tttgggctga aagcaaaggg 600 
ttaatgagat gttccagata 660 
aaagaaaaaa aaaa 714 



<210> 44 

<211> 1779 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2254315CB1 

<400> 44 

caggaggaag atggcggcgt ccgcagctgc 
gcacccagtg tgtctgttgg tgttgggaat 
gaggctcaca ggacacctgc atgcccaagg 
agcagtacat gaagttccct ttcctgccaa 
agaagtaatg aaacaatatg gacttggacc 
ctttgctacc agatttgatc aggtgatgaa 
atatgtgttg attgacacac ctggacagat 
aattatcact gaagcccttg catcctcatt 
atcgagaagt accaacccag tgaccttcat 
atacaaaacc aagctgcctt tcattgtggt 
ctttgcagtg gaatggatgc aggattttga 
tacatacgtc agtaacctga ctcgttcaat 
actcagggtg gtgggtgtct ctgctgttct 
agttaccagt gctgccgaag aatatgaaag 
aaaatcactg gccaacgcag agagccaaca 
agatatgggt tctgtagcct tggatgcagg 
gcacccttct gatttgatcc tgactcgagg 
cagcgatact gatgacattg accacagagt 
ccagaatttt atgcaagaat cgatggcaca 
tttagcacac ttcacttgtt tctagaagtc 
gttcttacct ctgaactggg ggctcccata 
tctcttatta gagaaatctt gtgactcaga 
ggcaggttaa tgctgattat tccttggcct 
ctgagctgat actcttccaa gcctacaact 
ttgctgctga ggaatggaat caaaagaacg 
ttattaggct agatgtatag cctctactcc 
aagttgccct tctattagca gccaaggaaa 
cagagtctcc ttggcaatca accaacgttg 
attataggac gtcaggtttg ttgaaaaaag 
ctgtttcatt ggtgacttgg ataaagactt 



cgctgagctc caggcttctg ggggtccgcg 6 0 
ggcgggatcc gggaaaacca cttttgtaca 120 
cactccaccg tatgtgatca acctggatcc 180 
tattgatatt cgtgatactg taaagtataa 240 
caatggcggc atagtgacct cactcaatct 300 
atttattgag aaggcccaga acatgtccaa 360 
tgaggtattc acctggtcag cttctgggac 420 
tccaacagtt gtcatctatg taatggacac 480 
gtccaacatg ctctatgcct gcagcatctt 540 
catgaataaa actgacatca ttgaccacag 600 
ggctttccaa gatgccttga atcaagagac 660 
gagcctggtg ttagatgagt tttacagctc 720 
gggtactgga ttagatgaac tctttgtgca 780 
ggagtatcgt cctgaatatg aacgtctgaa 84 0 
gcagagagaa caactggaac gccttcgaaa 900 
gactgccaaa gacagcttat ctcctgtgct 960 
aaccttggat gaagaggatg aggaagcaga 1020 
tacagaggaa agccatgaag agccagcatt 10 80 
atactggaag agaaacaata aataggagac 114 0 
cagaattttg gacctccacg tgaaagaact 1200 
agggataatt ttcctcagag tagcaaagtt 126 0 
tgaagtcagg gatagaagac ccttggacct 1320 
ttcccttgta tttatgcaag gaaggatata 1380 
tcaagtctta tcatttgaac tcaagtactt 1440 
tagtctoctg gtgaccacct cagatctcta 1500 
cccagcttct tgctcttgac cctgcactgt 1S60 
agggaaacat gagcttatcc agaacggtgg 1620 
ctatgaaata tgcctcacac tgtatagctc 1680 
tgggcaagac atgattaatg aatcagaatc 1740 
tttaatttt 1779 
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<210> 45 

<211> 2234 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No : 2415545CB1 

c400> 45 

ccccttctga aaaatggttt cacccgctcc 
gtccggcccg gaggggaact gttttctccg 
ggaggcgggg cagtggggcc ttcggcggcg 
tttgagaagg gcggagtgta cctgcacacc 
ctcatcgctg gtgtcatccg tgtcgtggaa 
cctgtagagg aggctggaga ttccacccaa 
ggtgactcat gtgcttctga ggaggaacca 
gctgtcatca gcactgtgcg gccacagccc 
cccagctgcc cccagggctc ctgggccttc 
cgccgctcca agccaggcct cagctgggcc 
tccctgcccg cactgcactt ccaccgcggg 
cgctacctgc tgttggccag ctccccgcag 
gactcctctg ctctctccaa ctccttccac 
aatgtggtgt cacgcttcct ccaggatccc 
gtgaccaact tcttccgggg tgccctgcag 
cccccgccac ccgacgatga gcccgagcct 
gggcctcggc caaccgtgga gcggggccct 
gtgggccctg aaggtcgcct gcagcaggtc 
ggtctgagcc ccagcctgcg gcgcgaggcc 
gaaggcacag ctgaggagca caaggcccac 
atgaagctgc agtggaaatc tgtgagccct 
ggataccgca gcctcatcga aagggatgtg 
gagggtcccg agaacccggg gctgggcctg 
tatcacttcg acctcggcta cgtccagggc 
gtcattcaga acgaggtgga tgctttctgg 
gggaactttg aagagagcca ggagaccatg 
ctgagggtgc tggaccccct gctctgcgac 
tgcttctgtt tccggtggct gctcatctgg 
cttcggctgt gggaggtgct gtggacaggg 
gcctgcgcca tcctggacat ggagagggac 
gagatcctca agcacatcaa cgagctgact 
cgcgccgagg ccctgcaccg ccagctaacc 
ggagatcctg gggctggccc cgccacgcag 
cgctgcctct gtacgcccac ccgggccccg 
ccgcagcccg acagcagcct ggagatcctg 
taaccccgcc aggcagcctc gttctgcaca 
gagggggcag gtgtgctccg ccgccctgct 
tcaaaaaaaa aaaa 



ttctctagaa cgggtcgggc cgctaccact 6 0 
gaagtgacaa cacgctgact aggaaaagga 12 0 
actatggaag gagccggcta cagggtggtg 180 
agcgctaaga agtatcagga ccgagactct 24 0 
aaggacaatg acgtcctcct gcactgggct 300 
atcctctcct ccaagaagga ctccagtggg 36 0 
acctttgacc ccggctatga acctgactgg 42 0 
tgccactcag agcccacgag aggtgcagag 48 0 
tcagtgagtc tgggggagct aaagtccatc 54 0 
tacctggttc tggtgaccca ggctggaggt 60 0 
ggcacccgcg ccctgctccg cgtcctcagc 66 0 
gactcccgcc tctaccttgt cttcccccac 720 
cacctgcagc tctttgacca ggacagctcc 780 
tactccacca ccttcagcag cttctcccga 840 
ccacagcctg agggagccgc ctccgacctt 900 
ggattcgagg tcatttcctg tgtggagctg 960 
ccagttacag aggaggagtg ggcacgccac 1020 
cctgagctga agaaccggat cttctcgggg 108 0 
tggaagttcc tcctagggta cctcagctgg 114 0 
atacgcaaga aaacggatga gtatttccgc 1200 
gagcaggagc ggagaaactc acttctgcat 126 0 
agccgcactg acaggaccaa caagttctac 1320 
ctgaacgata tcctcctcac ctactgcatg 13 8 0 
atgagtgatc ttctctcccc gatcctctac 1440 
tgtttctgtg gcttcatgga gctcgtgcaa 1500 
aagcggcaac tcgggcgact gctgctgctc 1560 
ttcctggatt cccaggactc cggctctctc 1620 
ttcaagaggg aattcccctt cccggatgtc 1680 
ctccctggcc ccaatctgca cctgctggtg 1740 
accctcatgc tgtccggctt cggctccaat 1800 
atgaagctga gcgtggagga cgtgctgacc 186 0 
gcctgcaccc gagctgcccc acaacgtgca 1920 
agccccacag cccctcgccc accgcctccc 1980 
cccaccccgc cgccctccac ggacacagcc 2 04 0 
cccgaggagg aggacgaggg cgccgactcc 2100 
ggcactttag cccgagccag gcacacctgc 2160 
gataagctgg cttcattaaa ctgacacttc 2220 
2234 



<210> 46 

<211> 3150 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> unsure 

<222> 36, 97, 99, 3070-3072, 3074, 3078, 3080-3082, 3085-3087, 3091, 
<222> 3099, 3100, 3103, 3107, 3110-3112, 3114, 3115, 3121, 3123, 3125, 
<222> 3128, 3136, 3138, 3140, 3141, 3143, 3145, 3147, 3149 
<223> Incyte ID No: a or g or c or t, unknown, or other 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2707969CB1 



<400> 46 

acacaggagc aatgcaaaat ttgataaagc 
gcccttcctc tcaagattcc tggagaccca 
aatgtgtcat gatgatgatg ataaagaccc 
caagatcagg ctgttgaatg ttcggacacc 
taccactgtg gatgaagcag agaaagcaat 
tgcaattcac ccacatttac ttgacatgaa 
cttccctaag ctgcagtctg atgtactttc 
aaggaatgcc cctgcccagt ggaggcggaa 
gcgtttagat aatgaccaga gggagaagta 
tatccgccag cccaaactgt ccaacctctc 
gtttgtagag ggcctgctga aggaatgccg 
gatgggccga gaagctgtgg agctagggca 
gaacaccctg attgccagcc tttgtgatct 
agtgaaacag gggaaatcag ccttatggtc 
gagaaaactc acatcaggaa gcctcagtac 
gaagtctgat gccagctcac tcatgcctcc 
gcacatccag aacatcgggg aaatcaagac 
actgtccatg gaaaaaaagt tactttccag 
gctcaccaaa aagttatata agcgctatgc 
gttcctctat cacctcctgt ctttcaatgc 
cacaactatc ctgatcccgt accacattct 
catgttcact gccaacccat ggatctgtat 
gcagattccc aggaatgtgc tagagatgac 
tactgtccag attggccatg ataactctgg 
gatggtcagg aatgagatca caggacatac 
gaagggcatg gatgatggaa gcctggagcg 
gcctgaggtg gatgagaggc catgccggac 
ccggaggctt gttaccatct cacccaacaa 
ggagtccatc ggggaggcag tcaatggcat 
gcgaggcagt ctgacgctgt tgctctgtgg 
ggctttccag catggattta aatcgccccg 
cctggaaaaa gcacaaacct attatgagac 
aaactggcat acaagagccc ggaacttctg 
ccggaacatc ggcaaggatg gcaagtttca 
cctcctacac cactggattg ccctgctggc 
ggatgtggca ctgatcaaag accatacact 
attgcaggag ttcaacatca cgctggagac 
tcccagcacc agccagcagc aggactgaga 
cccaaaggga cttaagcgat tgtgcaggag 
agagaactag aggatttttg gaataaataa 
ttttacacac tttcatgtga aagagtgata 
tttgaagctg gtgccctccc tcgccgtggc 
ctgagcctgt ggcactgcgt gcgggacagt 



atttttctgt cagatcagct gagccctact 60 
gatgtnngna tctttcattg acaacaaaat 120 
tgtactccgg gtatttgatt cccgagttga 180 
tactctccgt acatccatgt accagaagtg 240 
tgagctgcgt ctggcaaaaa ttgaccatac 3 00 
gattggacaa gggaaatatg agccgggctt 360 
cactgggcca gccagcaaca agtggacgaa 420 
agatcggcag aagcagcaca cagaacacct 4 80 
catccaggaa gccaggacta tgggcagcac 54 0 
tccatcagtg attgcccaga ccaattggaa 6 00 
caataagacc aagaggatgc tggtggaaaa 6 60 
tggggaggtg aacatcacag gggtggaaga 720 
cctggaaagg atctggagtc atggactaca 780 
ccacctgtta cattatcagg acaaccggca 84 0 
ctcaggaata cttcttgatt cagaacgtag 900 
cctgaggatc tccctgattc aggatatgag 96 0 
tgatgtggga aaggccagag catgggtgcg 1020 
acacctgaag cagctcctct cagaccatga 10 8 0 
cttcctgcgc tgtgatgacg agaaggagca 114 0 
cgtcgattac ttttgcttca ccaatgtctt 1200 
gatcgtacca agcaagaagc tggggggctc 12 6 0 
atcaggagaa ttgggtgaga cacagatcat 1320 
cttcgagtgc cagaacttgg ggaagcttac 13 80 
gctgtatgcc aaatggctgg tggagtatgt 1440 
ctacaagttc ccgtgtggcc ggtggttagg 1500 
gatcctagtt ggggagctgc tcacatccca 1560 
cccgccgctg cagcagtccc ccagtgtcat 1620 
caagcccaag ctgaacactg ggcagatcca 16 80 
tgtgaagcac ttccataagc ctgagaaaga 1740 
agagtgtggc cttgtctcgg ccttggaaca 1800 
gctcttcaaa aatgtcttca tttgggattt 186 0 
attagagaag aatgaagtag tccctgagga 1920 
ccgatttgtc actgcaatca acaatactcc 1980 
gatgctggtg tgcttgggag ccagagatca 2040 
tgactgcccc atcactgcac acatgtatga 2100 
tgtcaattcc ttgattcgtg tgctgcagac 216 0 
gtcccttgtc aagggcatcg acatctgacc 2220 
aagactcacc ctgcagctct gacctttttt 2280 
taggagacaa aatgtacact cactgtaaaa 2340 
tctattttag agtttatttg ctgatttgct 2400 
gggagaggga gcgaggctgg tgccgcttat 246 0 
cacatgctgg aagcctgagg cctccctgga 252 0 
tatgtttcct tgccccgtcg cattaatgag 2580 
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gcccttccac atcattttta aactaatgtt tttctatatt aacattatta tggatatttg 2640 

gctttcatag gccacacaca ggtgtgctgc gcgggaagcc ccatgcccca atcaaaggga 2 7 00 

tttttagtag tgcctctaag caagcaccga tgagtcagtc ccacgtattt tcttttttgt 2750 

cagtattgtt tgggaaggag acatgccggg atgtgtcatc gtgccaaata ccacatttcc 2820 

tgttggcaca gtttcacaga agtaaacata agcatgtttt aacaggtttt tcttttcttt 2880 

tttctttttt aaaatgtttt atttatttaa cccgccattg tgtgttttta agtattttct 2940 

ttttttaagg aaaggaaaag cttgtcacaa tctaactggc tatgttatta ttattaaatt 3000 

tatgttttgc aacttagaaa ccagctacag tatggcccac ttaataaaac acctgaaaca 3060 

aaaaaaaagn nngngggngn nngtnnngag naggagggnn ggngggnggn nngnngggag 3120 

ntnanttntg gggtgngngn ngngnangnt 3150 



<210> 47 

<211> 1806 

<212> DKA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2817769CB1 

<400> 47 

gtctgcgcgc aggtgccgct cggcgcccgg 
gtgcgtgccg ctcggcggag gggacgggcc 
agctgccggc aggacctttc tctcgctgcc 
gcacgatgcc ccctaaaaag ggaggtgatg 
ttggaacctc actgaaaatt ggtattgttg 
tcaatgtgtt aaccaatagt caggcttcag 
ctaatgagag cagagtacct gtgccagatg 
aaccagcaag caaaattcct gcctttctaa 
gagctcacaa tgggcagggc ctggggaatg 
gcatctttca tctaacacgt gcttttgaag 
tagatcctat tcgagatata gaaataatac 
tgattgggcc cattatagat aaactagaaa 
taaaacctga atatgatata atgtgcaaag 
ctgttcgctt ctatcatgat tggaatgaca 
ttttgacttc aaaaccaatg gtctacttgg 
agaaaaacaa atggttgata aaaattaaag 
tggtcattcc ttttagtggg gccttggaac 
gacagaagta tctggaagcg aacatgacac 
ggtttgcagc actccaacta gaatactttt 
ggaccatcag gaaagggact aaggctcctc 
aaaagggatt cattatggct gaagtaatga 
aaaatgcagt caaggctgct ggaaagtaca 
atggagatat tatcttcttc aaatttaaca 
tagttattgc tcagataaac atacaacttc 
atttctgaaa accaatgcga caaataaagt 
tatttttatt tgttttaaaa ttaaaatact 
gttcactaaa tgtgaacagc tttgcttttc 
aagcttttca ggaaccatat tactctcatg 
agcctgacac atttgacagt gaggacaatg 
tgcatgtttt acagtactcc agatgtctac 
aaaaaa 



gccgcccgtt ccgcggctgt cgccgccgtc 60 
tgcgttctct cctccttcct ccccgcctcc 120 
gctgggaccc cgtgtcatcg cccaggccga 180 
gaattaaacc acccccaatc attggaagat 240 
gattgccaaa tgttgggaaa tctactttct 3 00 
cagaaaactt cccgttctgc actattgatc 3 60 
aaaggtttga ctttctttgt caataccaca 420 
atgtggtgga tattgctggc cttgtgaaag 480 
cttttttatc tcatattagt gcctgtgatg 540 
atgatgatat cacgcacgtt gaaggaagtg 6 00 
atgaagagct tcagcttaaa gatgaggaaa 660 
aggtggctgt gagaggagga gataaaaaac 720 
taaaatcctg ggttatagat caaaagaaac 78 0 
aagagattga agtgttgaat aaacacttat 840 
ttaatctttc tgaaaaagac tacattagaa 900 
agtgggtgga caagtatgac ccaggtgctt 96 0 
tcaagttgca agaattgagt gctgaggaga 1020 
aaagtgcttt gccaaagatc attaaggctg 1080 
tcactgcagg cccagatgaa gtgcgtgcat 1140 
aggctgcagg aaagattcac acagattttg 12 00 
aatacgaaga ttttaaagag gaaggttctg 126 0 
gacaacaagg cagaaattat attgttgaag 1320 
cacctcaaca accgaagaag aaataaaatt 13 8 0 
caaaaggcat ctgattttta aaaaattaaa 1440 
tggggagatg ggaatctttg acaaacaaat 1500 
gtgtaccccc ccccactcca tgaaatgcag 1560 
acgtgattaa gaccctactc caaattgtag 1620 
atacttcatt aatctccatc atgtatgcca 1680 
tggcttgctc ctttttgaat ctacagataa 1740 
actcaataaa acatttgaca aaaccaaaaa 1800 
1806 
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<210> 48 
<211> 2880 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2917557CB1 

c400> 48 

gaggaggagg aggaggaaga agaggaagaa 
gaggagggag aaagtgaagg cagtgaaggt 
aaggattcag ggaagacatt agataaaaag 
tatgactctg atgatgatcg gactaaagaa 
attgagaaac ggcgacttga acatagtaaa 
attatctgcg tacttgggca tgtggacaca 
cacacacatg tacaagatgg tgaagcaggt 
gttcctcttg aagctattaa tgaacagact 
gtacggattc caggaatgct aattattgat 
agaaatagag gaagctctct ttgtgacatt 
ttggagcccc agacaattga gtctatcaac 
gttgcactca ataagattga taggttatat 
gctgctactt taaagaagca gaaaaagaat 
gctattattg tagaatttgc acagcagggt 
gatccccgca cttttgtgtc tttggtacct 
agtctgatct accttcttgt agagttaact 
tgtgaagagc tgagagcaca ggtgatggag 
atagatgtca tcttgatcaa tgggcgtttg 
gtagaagggc ccattgtaac tcagattcga 
ttacgagtga agaaccagta tgaaaagcat 
attcttggaa aagacctgga gaaaacattg 
gaagatgaaa tccctgttct taaagatgaa 
gctatcaaat tagaagaaaa aggagtctat 
gctctactgg aatttctgaa aacatcagaa 
gtgcataaaa aagatgttat gaaggcttca 
gtaattttgg ccttcgatgt gagaattgaa 
ggagttagaa tttttagtgc agaaattatt 
agacaagact acaagaaaca gaaacaagaa 
aagataaaaa tcctccctca gtacattttt 
acggtggaag caggtcaggt gaaacagggg 
gttgacatcg gaatagtaac aagtattgaa 
aaaggacaag aagtttgtgt aaaaatagaa 
ggaagacatt ttgaagctac agatattctt 
gcactcaaag actggttcag agatgaaatg 
ctgaagaaag tatttgaaat catctaattt 
caatactgtg ttgtaatatc ccaacaaaaa 
acactgatgg acttaagtat ggaaggaaga 
aaaccaagaa acttacactg gtttgacagt 
tgcctgttca ctcacctctc ccttccccaa 
ttgcccttcc ccaaatttgg atttttatta 
attaaatcag tactgcagta tttgattaac 
acttttttga cgtaagaaat acttctttat 
cagcattctt ctgccatatg cctttagggc 
tatttcttta gctgctttgt gtgaaaccat 
tgcttgtgta gtcacgagtc cattgtaatc 



gaagatgaag aaagtgaaga agaggaggaa 60 
gatgaggaag atgaaaaggt gtcagatgag 120 
ccaagtaaag aaatgagctc agattctgaa 180 
gaaagggctt atgacaaagc aaaacggagg 240 
aatgtaaaca ccgaaaagct aagagcccct 3 00 
gggaagacaa aaattctaga taagctccgt 360 
ggtatcacac aacaaattgg ggccaccaat 420 
aagatgatta aaaattttga tagagagaat 4 80 
actcctgggc atgaatcttt cagtaatctg 540 
gccattttag ttgttgatat tatgcatggt 600 
cttctcaaat ctaaaaaatg tcccttcatt 660 
gattggaaaa agagtcctga ctctgatgtg 720 
acaaaagatg aatttgagga gcgagcaaag 78 0 
ttgaatgctg ctttgtttta tgagaataaa 84 0 
acctctgcac atactggtga tggcatggga 90 0 
cagaccatgt tgagcaagag acttgcacac 960 
gttaaagctc tcccggggat gggcaccact 1020 
aaggaaggag atacaatcat tgttcctgga 10 8 0 
ggcctcctgt tacctcctcc tatgaaggaa 1140 
aaagaagtag aagcagctca gggggtaaag 12 00 
gctggtttac ccctccttgt ggcttataaa 1260 
ttgatccatg agttaaagca gacactaaat 1320 
gtccaggcat ctacactggg ttctttggaa 13 80 
gtgccctatg caggaattaa cattggccca 1440 
gtgatgttgg aacatgaccc tcagtatgca 1500 
cgagatgcac aagaaatggc tgatagttta 1560 
tatcatttat ttgatgcctt tacaaaatat 1620 
gaatttaagc acatagcagt atttccctgc 1680 
aattctcgag atccgatagt gatgggggtg 1740 
acacccatgt gtgtcccaag caaaaatttt 1800 
ataaaccata aacaagtgga tgttgcaaaa 1860 
cctatccctg gtgagtcacc caaaatgttt 192 0 
gttagtaaga tcagccggca gtccattgat 1980 
cagaagagtg actggcagct tattgtggag 204 0 
tttcacatgg agcaggaact ggagtaaatg 210 0 
tcagacaaaa aatggaacag acgtatttgg 2160 
aaaataggtg tataaaatgt tttccatgag 222 0 
ggtcagttac atgtccccac agttccaatg 2280 
cccttctcta cttggctgct gttttaaagt 2340 
cagatctaaa gctctttcga ttttatactg 2400 
caagcttctg cagattttgt gattcttggg 2460 
ttatgcatat tcttcccaca gtgatttttc 2520 
ttttataaaa tagaaaatta ggcattctga 2 58 0 
ggtgtaaaag cacagctggc tgctttttac 2640 
atcacaattc taaaccaaac taccaataaa 2700 
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gaaaacagac anccaccagt aagcaagctc tgttaggctt ccatgttagt gragcctctc 2760 
tcccacaagt tgtcctccta ggacaagaat tatcttacaa actaaactat caccacacta 2820 
ccttgtatgc cagcacctgg taacagtaga gatttttata cattaatctt gacccgtttt 2880 



<210> 49 

<211> 1109 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3421335CB1 

<400> 49 

cccacgcgtc cgctcgctct tgggtcatgc 
ggagagccac gcaggatctc acagctgcag 
gccaccatgg gttctgtgaa ttccagaggt 
ggcctggact cggcgggcaa gaccacgctc 
gagaccctgc ccactgttgg tttcaacgtg 
ctgactctct gggacgttgg ggggcaggcc 
gaaggcacag atatcctcgt gtacgtgctg 
tcggcggctg agctcacaga agtcctgaac 
gtgctggcca acaagcagga ggcacctgat 
ctgagtctag agagattcca ggaccactgc 
ggggaggggc tgcccgaggc cctgcagagc 
atgtgtctgc aggcgagagc ccatggggct 
agacagagca gcatatcttt gctcatacaa 
ttacgcttag cctatcaaac aagaaatgct 
ccagcactgt gggagaccac ggtgggggaa 
tcacaacacc ccatttctac taataatcaa 
gtagtcccag ctacttggga ggctgaggca 
ttgcagtgag ccaagatcgc gccactgcac 
tctagtggtg ataataataa tgatgtagt 



ctggccagca gaaagcagct ccatagggga 60 
tctaatagta acacagagga ttcagcagtg 120 
cacaaggcgg aagcccaggt ggtgatgatg 180 
ctttacaagc tgaagggcca ccagctggtg 240 
gagcctctga aagctcctgg gcacgtgtca 3 00 
ccgctcagag ccagctggaa ggactatctg 360 
gacagcacag atgaagcccg cttacccgag 420 
gaccccaaca tggctggcgt ccccttcttg 480 
gcacttccgc tgcttaagat cagaaacagg 540 
tgggagctcc ggggctgcag tgccctcact 600 
ctgtggagcc tcctgaaatc tcgcagctgc 660 
gagcgcggag acagcaagag atcttgatcc 720 
actagaagaa ccagctgatc cttgagaaat 780 
ggcttggccc ggtggctcat gcctgtaatc 840 
tcccttgagc ccaggagttg gagagcaaca 900 
aaaattggcc gggcatggtg gcatgtgcct 960 
ggagaatcgc ttgagcccaa gaggtagagg 1020 
tccagtctgg gcaacagagt gagaccctgt 1080 
1109 



<210> 50 

<211> 2407 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 605761CB1 

<400> 50 

ctttcgctgg cgccattacc tgagttctcc 
ggtcccagga gcttccaagg taaccgcgca 
aacccggaag tgacgctctt accgggtgtc 
gcaagggtcg gcgtcgcacg tgtgaaaccg 
caagttccga gggcccggcc caggtctacc 
gggaggagct ggtcatggac gaggaggcct 
ccccctgtct cagctttgac atagtccggg 
ctcttacact ttacttgtgt gctgggaccc 
tgatgcttcg gatgcacaat ctgcatggga 



tccagcgttt ccgcaccctc tccgattagc 60 

gtagggcgga tctcattagg cggaaagcga 120 

agcagcgaga gggttcgaag atggcggcgc 180 

gggaacccat ggaagccgag tccggcgaca 240 

tgcccggccg ggggccgccg ctacgcgaag 3 00 

atgtgctcta ccaccgagcg cagactggcg 360 

atcacctggg agacaaccgg acagagcttc 420 

aggctgagag cgcccagagc aacagactga 480 

caaagccccc accctcagag ggcagtgatg 54 0 
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aagaagaaga ggaggaagat gaagaggatg 
ccatggtgcc ccactatggt ggcatcaacc 
ctgtggctgg ggtgtggtca gagaagggcc 
tgcaggtggt ggaggagccc caggccctgg 
tgaagcccat cttctccttc gctggacaca 
cccgggtgac cggtcgcctg ctgaccggtg 
ctacggacgg cggctcctgg cacgtggacc 
tggaggacct gcagtggtca ccgactgaga 
cctccatccg catctgggac atccgggcag 
ccaccgccca tgatggggac gtcaatgtca 
tcagtggcgg ggatgatggg gccctcaaga 
ccccagtggc caccttcaag cagcacgtgg 
aggacagcgg ggtctttgca gcctcgggtg 
cagtggagcg ggaccctgag gcgggcgacg 
cgcagcagct gctgttcgtg caccagggcg 
cgcagtgccc agggctcctg gtcagcacgg 
tcagcgtctg aggcgtccca ctggctctga 
attgggctcc cctggaaggg gttcattcag 
gctgccgtga tggattctgt ttgacgtatt 
gacccctctc accaaagaac tcggtttaac 
gacttgtgtg gcctggtgtg gcctgtgtgt 
tgacctgtgt ccccagaacc cagttttttg 
tcgcccaggc tggagtgcag tagcacgatc 
taaagtgatt ctctcagctc agtctcccag 
accccgttaa tttttgtatt tttagtagag 
ctcaaattct tgatctcaag tgatctgtcc 
ttacaggcgt gagccaccgc gtccggctca 
ggggactcgg gggatataca gtggctgcac 
atttgcttct gcccgttgtc ttcctgccag 
ggctacatgc gaccctctcc cctcctccct 
aaggtcaggg ctcctgagca gcaataaagg 
aaaaaaa 



aagaagagcg gaaacctcag ctggagctgg 6 00 
gagttcgggt gtcatggctg ggtgaagagc 660 
aggtggaggt gtttgcgctg cggcggcttc 72 0 
cagccttcct ccgggatgag caggcccaaa 780 
tgggcgaggg ctttgccctt gactggtccc 840 
actgtcaaaa gaacatccac ctctggacac 900 
agcggccatt cgtgggccac acacgctctg 960 
acacggtgtt tgcctcctgc tcagctgacg 102 0 
cccccagcaa ggcctgcatg ctcaccacag 1080 
tcagctggag ccgccgggag cccttcctgc 1140 
tctgggacct tcggcagttc aagtctggtt 1200 
cccccgtgac ctccgtcgag tggcaccccc 1260 
cagaccacca gatcacacag tgggacctgg 1320 
tggaggccga ccccggactg gccgacctcc 13 8 0 
agaccgagct gaaggagctg cactggcacc 1440 
cgctgtcagg cttcaccatc ttccgcacca 1500 
tcttgcttcc tgcttggaaa ctgaagtcga 1560 
gtctgttgac tgagactggc cggcctgtgg 1620 
gttctctaga aggcctggct ctgatccagt 16 80 
cagggctctg taagaccact cccacccaga 1740 
cggattcctt cctgtcagct gtgacccatt 1800 
tttgtttgtt tgagacggag tcttggtctg 1860 
ttggctcact gcaacctccg cctcctgggt 1920 
gtagctggga ttacaggcat gtgccaccac 1980 
acggggtttc accatgttgg ccaggctggt 2040 
gccccggcct cccagagtgc tgggttggga 2100 
ggacccagtt ttggctgctg gttcccagca 2160 
caaattggag gtgtgggttc ctccaacaca 2220 
ctgggtttgg ccaggatttc tccgtgtggg 22 8 0 
gactttagag gctggtgctg tgtcgggagg 234 0 
accaggaaga ggcctgaggt gtaaaaaaaa 24 00 
2407 



<210> 51 

<211> 1158 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 483862CB1 

<400> 51 

ggaagaccgt cccggatggc ctcggggact gccagtgtgt ggaggtgagc tccgggattg 60 
ccggcattcc cgcttctgct ggttgcttca tgctgcaggc tgcggccgtc agccctcgct 120 
cgcattggtg gcgctgaggt gccggggcag caagtgacat gtcgtcgggc ctccgcgccg 180 
ctgacttccc ccgctggaag cgccacatct cggagcaact gaggcgccgg gaccggctgc 240 
agagacaggc gttcgaggag atcatcctgc agtataacaa attgctggaa aagtcagatc 3 00 
ttcattcagt gttggcccag aaactacagg ctgaaaagca tgacgtacca aacaggcacg 360 
agataagtcc cggacatgat ggcacatgga atgacaatca gctacaagaa atggcccaac 420 
tgaggattaa gcaccaagag gaactgactg aattacacaa gaaacgtggg gagttagctc 4 80 
aactggtgat tgacctgaat aaccaaatgc agcggaagga cagggagatg cagatgaatg 540 
aagcaaaaat tgcagaatgt ttgcagacta tctctgacct ggagacggag tgcctagacc 6 00 
tgcgcactaa gctttgtgac cttgaaagag ccaaccagac cctgaaggat gaatatgatg 660 
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ccctgcagat cacttttact gccttggagg gaaaactgag gaaaactacg gaagagaacc 72 0 
aggagctggt caccagatgg atggctgaga aagcccagga agccaatcgg cttaatgcag 780 
agaatgaaaa agactccagg aggcggcaag cccggctgca gaaagagctt gcagaagcag 84 0 
caaaggaacc tctaccagtc gaacaggatg atgacattga ggtcattgtg gatgaaactt 900 
ctgatcacac agaagagacc tctcctgtgc gagccatcag cagagcagcc acgtaagtag 96 0 
gcaggtttgg gccagggaaa agacagcttg aggagcaata tgaaggcaca tctgtggaca 102 0 
tgacaaagaa tgcagtcaga tgcacccaac cccttactcc ttttctggga cacccagcgt 108 0 
cgaacacacc acagaggtgt ctagtctttc tcagttcacc tctgcttaat gggagggaag 114 0 
cagaacacgg gtggcttc 1158 



<210> 52 

<211> 1026 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1256777CB1 

<400> 52 

ctgcgggcct ctctccgtcg ccatggaaac 
gcgccgcctc ccgtgggctc cggccggcta 
agatcctctt cgtggggcct tgcgagagtg 
aatcttctga catcactgaa tacagcccaa 
acccgcatgt taccagcaac aacaaaggca 
gtggcgatgc taagtttgag tcctgctggc 
tgatcgtctt caatgctgac atcccaagcc 
gctttgtcca acagccgtcc ttacaggaca 
caggctctgg agatgataaa ggaagcctgt 
tggtgcactc aaacctggaa gatgaccctg 
taaaaagcat aatcaactcc atgtctgaga 
cctagccagc cttcacctgg gactgccaca 
cagatctgaa atcacatcca gctcctgatg 
gttcctacct gcaggaaggc acctgtcaca 
gccctgagtt cagttgagaa aatcctatta 
cccaaagacc tgtaaaatgg agggatttac 
tggacttgta accgtcgcag cccaatgata 
aaaaaa 



gaaagcggcc aagtagagct ccgtcctgac 60 
agccgcggcg gacaactatg ctgaaagcca 120 
gaaaaactgt tttggccaac tttctgacag 180 
cccaaggagt gaggatccta gaatttgaga 24 0 
cgggctgtga attcgagcta tgggactgtg 3 00 
cggccctgat gaaggatgct catggagtgg 360 
accggaagga aatggagatg tggtattcct 420 
cacagtgtat gctaattgca caccacaaac 480 
ctttgtcgcc acccttgaac aagctgaagc 540 
aggagatccg gatggaattc ataaagtatt 600 
gcagagacag ggaggagatg tcaattatga 660 
tccccagtga aatcagcatg tttctcggtg 720 
ttttcttctc cctctgactg cagaggaagt 780 
cagggcgttc actcagacca tctgtgctct 840 
tcaaatttgg atttcctggc cccagaactt 900 
cacctcacat atgtccagtt aaacagtttg 960 
caacagtagt ttaatcacgt gaaaaaaaaa 1020 
1026 



<210> 53 

<211> 2456 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> niisc_f eature 

<223> Incyte ID No: 2198779CB1 

<400> 53 

gacgagcgac gagatgacgg aggagcggtt 
aaggcctcgc gccgctgcgg gtcctgcgac 
cagagaggtt acctggaaat ccaacaccgc 
acttcgatag gattggagaa ggagttgaca 



ggccaacgca tgccggcagt cggtgtaaac 60 
cgctcctggc tgggatttcg attggtcggt 12 0 
ccaacacccc tcccgctccc cagtccgggg 180 
ggaggagccc ccgcacagga cctaagaatg 240 
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ctgtgaccag aagatgggat cgcggaacag 
ctccgagggc ttgccccgaa gaggggctgg 
ggatgaagat gtggatctgg cccaggtact 
gttggtgcag ggaggaggtg cagcaaattt 
ggaagagaat gacagagctt gggatggtcg 
tgctacccct gacacccggg agctggaatt 
cacagggcag ctggggctta ggcgggccgc 
ccagagagaa cggggcctct gccatcgggg 
gatatctcac ttcttgccca atgatctggg 
ctgtggcatc tacagcaaag atggtcaaat 
ccgactctat gactgccgat atggccgttt 
tgtaggctgg agcgtcttgg atgtggcctt 
tagctggtct gattacattc atatctgcaa 
cctggatctc aggccagatg agcgtcgctt 
tggacgagaa gtactaggag gggccaatga 
gaaccggcgc acccttcaga ttgagtccca 
tgatataagc tcccaaatcc tgttctctgg 
tcgacgcacc atgcgggagg atgaccccaa 
tggcatcacc ttcattgaca gcaagggtga 
ccagaccatc aaactctggg atatccgacg 
acgccaggct gccacacagc aaaactggga 
gtttactctg catccctacc cagcctggcg 
gatgacctac cggggccacg gagtgctgca 
tcatagcact ggccagcagt tcatctacag 
cgaccttcta agtggccaca ttgtgaagaa 
cgtcagttgg cacccctttg aagagaagat 
tctgtggcag taccgccagg ctgagtactt 
tgccagcgcc cctgccccag tgccccaatc 
ccaacctcca gccccatata gggtgaacct 
tctcccttgt ggggaatgtt tggaggaatc 
tgggctctga gcctcagctg agccctggaa 
tacgtgctca cacccagtca gcttgggtcc 
cattatctgg ggtgtggcct ctgccagcaa 
ttgaatgtta ggggttggat cctagagtag 
cagccaggcc tgccaggtct tcacgttgag 
cctggccttt cttcctgagg tctctagggg 
ccctcctggg gtggggatta tgtctgctgt 



cagcagtgca ggatccgggt ccggagaccc 30 0 
cctgcgtcgg agtgaggaag aggaagaaga 360 
ggcctatctc ctccgcagag gccaagtgag 420 
acaattcatt caggccctct tggactcaga 480 
tcttggggat cgatacaacc cacctgtgga 54 0 
caatgagatc aagacacaag tggaactggc 600 
ccagaagcac agctttcctc gaatgttgca 660 
aagcttctcc cttggagaac agtctcgagt 72 0 
cttcactgat agctactctc agaaggcttt 780 
attcatgtct gcttgccaag accagacaat 840 
ccgtaaattc aagagcatca aggcccgcga 900 
cacccctgat gggaaccact tcctctactc 960 
tatctatggt gagggagata cacacactgc 1020 
tgctgtcttc tccattgctg tctcctcaga 1080 
tggctgcctg tatgtctttg accgagaaca 1140 
tgaggatgat gtgaatgcag tggcctttgc 12 00 
gggagatgat gccatctgca aagtgtggga 12 60 
gcctgtgggt gcactggctg gacaccagga 1320 
tgcccggtat ctgatctcca actctaaaga 1380 
cttttccagc cgggaaggca tggaagcttc 1440 
ctatcggtgg cagcaagtgc ccaaaaaagg 15 00 
gaagctgaag ctcccagggg acagctcctt 1560 
caccctcatc cgctgccggt tctcccccat 1620 
tggctgctcc actggcaaag tggttgtgta 1680 
gctgaccaac cacaaggcct gtgtgcgtga 1740 
tgtcagcagt tcgtgggacg ggaacctgcg 18 00 
ccaggatgac atgccagaat ctgaggaatg 1860 
ctctacaccc ttttcctcac cccagtagat 192 0 
cttgataagc tctctgcctc ctcctccctt 1980 
actggcattt gatggggaat aacataagcc 2040 
gattctcccc atggggcaga gtggtctcct 2100 
ctatctctgg ccagagtttg gcaggactgc 2160 
gagaagtgtc ctgggtgttt ttaatcatgt 2220 
atgcctgagg ccacatctga acagacctgt 2280 
gattcaactg gccaatcaca ggacaggtgt 2340 
aggggcatgg gtaagggtgt ttcctcagca 2400 
catgtctggg tctttaaggt aggaca 2456 



<210> 54 

<211> 1771 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2226116CB1 

<400> 54 

cggctcgagt taatattttc gttgagcgga 
gtgttgtgga agatttcatg ttttaaatcc 
gataggctga cgggtcagag ggcaagacgt 
aaaatcggat tcatcagttc acttgtttgt 
gaggccctcg tcttcagttt ctgtttcatg 
caatttgact tctgtagggg ccatggcacc 



accttgctat tccataagag gatgtgtcca 60 
tttgtacaga aatcctgctc ccaagtcaca 120 
gacccagggc cgagagggtg agtgaccagg 180 
ttcagaaacg tgcacaaaga cctgctgcat 240 
cccagcatta aaccaagtat ctcattttgc 300 
tgcaaggtgt ttctcagcaa gattgaggac 36 0 
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cgtgtttcag ggcgtggggc attgggcttt 
gctactgcca cagcgggctt ctcccaggct 
gcatcaggaa ctcccgggga agaagctgct 
ggactatcgg agagtgcttg tctgtggagg 
cagtgagccc cgcaaggagt ttggaggccc 
cgtcattctg agagttgacc agcaagtcaa 
gggtttcagt ggagaagatg gagggagtaa 
ctacatccgg gtccccgtgg gcacgctggt 
gtcttgcgtg ggagatgagt acattgccgc 
cttcttcctg gccaacaaca accgtgcccc 
gcagcgagtt ctccacctgg agctcaagac 
caacgccggg aagtcctcac tgctccgggc 
ctacccgttc accaccctga agccccacgt 
aatagcagtg gccgacatcc ccggcatcat 
gtccgccttc ctcaggcaca tcgagcgctg 
tcagcctgag ccgtggactc aagttgacga 
gggcctgtct gcgaggcccc acgcaatcgt 
agccaatctg tcccagctcc gggatcactt 
gaccggcgag aacctggagc agctgctgtt 
ggaggccgag ctgggccagg gccgccagcc 
cgcctctggg cctctgtctg agcaaacctg 
agtgccttgt ggacacgggg gagttgtggt 
cctgagatgc cctcatgttg ggaagcattc 
ttcctgcacc tgtcagcctg cgctgactga 



gtccacatgg gctggcctga agcccagccg 420 
gctctcggtc ggccgtgcgg acctcgccaa 4 80 
ctctgagaaa aagctgaaaa ggcactttgt 540 
aaacggaggc gctggggcaa gctgcttcca 600 
tgatggaggg gacggaggca acggtggaca 66 0 
gtccctgtcg tcggtcctgt cgcggtacca 720 
aaactgcttc gggcgcagtg gcgccgtcct 78 0 
gaaggaggga ggcagagttg tggccgacct 84 0 
gctgggcggg gcaggaggga aaggcaaccg 300 
tgtgacctgt acccctggac agccaggaca 960 
ggtggcccac gccggaatgg tgggattccc 1020 
catttcaaac gccagacccg ccgtggcttc 1080 
cgggatcgtc cactacgaag gccacctaca 1140 
acgaggcgcc caccagaaca ggggtctggg 120 0 
ccgctttctc ttgttcgtgg tggatctttc 1260 
tttaaaatat gaactggaga tgtatgaaaa 1320 
cgcaaacaag attgacctcc ctgaagccca 13 8 0 
gggacaggag gtcatcgtgc tgtcggcgtt 144 0 
gcacctgaag gtgctgtatg acgcctacgc 150 0 
gctcaggtgg tagccacgcc agagcggggt 156 0 
ggtgtgaatt cggtggtttt gaatgcataa 1620 
gcttctgggt ctctgggccc cgcctgctgg 1680 
cgtgcccccc accccgcctg ccctccgtat 1740 
t 1771 



<210> 55 

<211> 2724 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2504472CB1 

<400> 55 

gctgaccagt tggcgacatg gtggcacccg tgctggagac ttctcacgtg ttttgctgcc 6 0 

caaaccgggt gcggggagtc ctgaactgga gctctgggcc cagaggactt ctggcctttg 120 

geacgtcctg ctccgtggtg ctctatgacc ccctgaaaag ggttgttgtt accaacttga 180 

atggtcacac cgcccgagtc aattgcatac agtggatttg taaacaggat ggctcccctt 240 

ctactgaatt agtttctgga ggatctgata atcaagtgat tcactgggaa atagaggata 3 00 

atcagctttt aaaagcagtg catcttcaag gccatgaagg acctgtttat gcggtgcatg 36 0 

ctgtttacca gaggaggaca tcagatcctg cattatgtac actgatcgtt tctgcagctg 420 

cagattctgc tgttcgactc tggtctaaaa agggtccaga agtaatgtgc cttcagactt 480 

taaactttgg aaatggattt gctttggctc tctgcttatc ttttttgcca aatactgatg 54 0 

taccaatatt agcatgtggc aatgatgatt gcagaattca catatttgct caacaaaatg 6 00 

atcagtttca gaaagtgctt tctctctgtg gacatgagga ttggattaga ggagtggaat 660 

gggcagcctt tggtagagat cttttcctag caagctgttc acaagattgc ctgataagaa 720 

tatggaagct gtatataaag tcaacatctt tagaaactca ggatgacgat aacataagac 780 

tgaaagaaaa tacttttacc atagaaaatg aaagtgttaa aatagcattt gctgttactc 840 

tggagacagt gctagccggt catgaaaact gggtaaatgc agttcactgg caacctgtgt 900 

tttacaaaga tggtgtccta cagcagccag Cgagattatt atctgcttcc atggataaaa 96 0 

ccatgattct ctgggctcca gatgaagagt caggagtttg gctagaacag gttcgagtag 102 0 

gtgaagtagg tgggaatact ttgggatttt atgattgcca gttcaatgaa gatggctcca 1080 

tgatcattgc tcatgctttc cacggagcgt tgcacctttg gaaacagaat acagttaacc 1140 
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caagagagtg gactccagag attgtcattt caggacactt tgatggtgtc caagacctag 1200 

tctgggatcc agaaggagaa tttattatca ctgttggtac tgatcagaca actagacttt 1260 

ttgctccatg gaagagaaaa gaccaatcac aggtgacttg gcatgaaatt gcaaggcctc 1320 

agatacatgg gtatgacctg aaatgtttgg caatgattaa tcggtttcag tttgtatctg 1380 

gagcagatga aaaagttctt cgggtttttt ctgcacctcg gaattttgtg gaaaattttt 1440 

gtgccattac aggacaatca ctgaatcatg tgctctgtaa tcaagatagt gatcttccag 1500 

aaggagccac tgtccctgca ttgggattat caaataaagc tgtctttcag ggagatatag 1560 

cttctcagcc ttctgatgaa gaggagctgt taactagtac tggttttgag tatcagcagg 162 0 

tggcctttca gccctccata cttactgagc ctcccactga ggatcatctt ctgcagaata 168 0 

ctttgtggcc tgaagttcaa aaactatatg ggcacggtta tgaaatattt tgtgttactt 1740 

gtaacagttc aaagactctg cttgcctcag cttgtaaggc agctaagaaa gagcatgcag 1800 

ctatcattct ttggaacact acatcttgga aacaggtgca gaatttagtt ttccacagtt 1860 

tgacagtcac gcagatggcc ttctcaccta atgagaagtt cttactagct gtttccagag 1920 

atcgaacctg gtcattgtgg aaaaagcagg atacaatctc acctgagttc gagccagttt 1980 

ttagtctttt tgccttcacc aacaaaatta cttctgtgca cagtagaatt atttggtctt 2 040 

gtgattggag tcctgacagc aagtatttct tcactgggag tcgagacaaa aaggtggttg 2100 

tctggggtga gtgcgactcc actgatgact gtattgagca caacattggc ccctgctcct 2160 

cagtcctgga cgtgggtggg gctgtgacag ctgtcagcgt ctgcccagtg ctccaccctt 2220 

ctcaacgata cgtggttgca gtaggattgg agtgtggaaa gatttgctta tatacctgga 2280 

aaaagactga tcaagttcca gaaataaatg actggaccca ctgtgtagaa acaagtcaaa 2340 

gccaaagtca tacactggct atcagaaaat tatgctggaa gaattgcagt ggaaaaactg 24 00 

aacagaagga agcagaaggt gctgagtggt tacactttgc aagctgtggfc gaagatcaca 2460 

ctgtgaagat acacagagtc aataaatgtg cactgtaatg gacttaataa ctacatgctt 2520 

gcagtcactg gtatcttaaa atattatcat gtaaacaggt catctttacc ttcataactg 2580 

aattgagttt ctgggttttt tttttttttg agatggagtc ttgctttgtc acaacctcca 2640 

cctcccaggt tcaagcgatt ctctttcttc agcctcctga gtagctggga ctagaggcac 2 700 

accaccatgc ccggctaatt tttg 2724 



<210> 56 
<211> 2963 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3029920CB1 
<400> 56 

ggccgaagag gctggcaggt ggcgccgtgg ggtgggtgct cctggtgaga ggagtccact 6 0 

ccgtgcgtgc gggcggaggc cggcccccga gagccgccga catgaagaaa gacgtgcgga 120 

tcctgctggt gggagaacct agagttggga agacatcact gattatgtct ctggtcagtg 180 

aagaatttcc agaagaggtt cctccccggg cagaagaaat caccattcca gctgatgtca 24 0 

ccccagagag agttccaaca cacattgtag attactcaga agcagaacag agtgatgaac 300 

aacttcatca agaaatatct caggctaatg tcatctgtat agtgtatgcc gttaacaaca 36 0 

agcattctat tgataaggta acaagtcgat ggattcctct cataaatgaa agaacagaca 420 

aagacagcag gctgccttta atattggttg ggaacaaatc tgatctggtg gaatatagta 480 

gtatggagac catccttcct attatgaacc agtatacaga aatagaaacc tgtgtggagt 54 0 

gttcagcgaa aaacctgaag aacatatcag agctctttta ttacgcacag aaagctgttc 600 

ttcatcctac agggcccctg tactgcccag aggagaagga gatgaaacca gcttgtataa 660 

aagcccttac tcgtatattt aaaatatctg atcaagataa tgatggtact ctcaatgatg 720 

ctgaactcaa cttctttcag aggatttgtt tcaacactcc attagctcct caagctctgg 780 

aggatgtcaa gaatgtagtc agaaaacata taagtgatgg tgtggctgac agtgggttga 84 0 

ccctgaaagg ttttctcttt ttacacacac tttttatcca gagagggaga cacgaaacta 90 0 

cttggactgt gcttcgacga tttggttatg atgatgacct ggatttgaca cctgaatatt 960 
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tgttccccct gctgaaaata cctcctgatt 
tatttctcca aagcaccttt gacaagcatg 
atgagcttaa agatttattt aaagttttcc 
acacagtttg taccaatgaa agaggctgga 
cgctcacgac ttatttagat gtacagcggt 
caatattgac tgagcaagag tctcaagctt 
tagacctgca gaaaaaacaa actcaaagaa 
aaaactgtgg gaaaagtgga gttcttcagg 
agaaaattcg tgaagatcat aaatcctact 
aagagaaata cttgttgttg catgatatct 
tcatttgtga tgttgtatgc ctggtatatg 
gtgccaggat ttttaagcaa cactttatgg 
caaagtcaga cctgcatgaa gttaaacaag 
ggaaacacaa aatgcctcca ccacaagcct 
aggatatctt tgttaaattg acaacaatgg 
tcaagagctc cacgttttgg cttcgagcaa 
gctttgctat gtacaaagca ttattgaaac 
accaaaaaca aatactttta tgtacattct 
gatatttata catgcagagt tactttatta 
aatgatagtt ataactgcag tattggctag 
ctaaaatggc taaattccag aggccaaaag 
aggcaagata tggtctccca agctgagttc 
ggtattgtta gtgctcactt ggctcactct 
ttactcatgg atcactttat ttatttcaca 
aggacataag gtacaatttg ccattgtctc 
agcttaccaa catgacattt tttcagtcag 
cagtctagaa acttgtgtag ctgagtgtgc 
agcaattagc agaggcgtta tctatgtgat 
ttatagccct ttcaatgaaa taaaaaaaaa 
ataaagagtc acccttacta ctgttgaatt 
ctgtgtctgt tatagtatat tacagtaact 
taggataacc tagaggcagt aattttttaa 
ctcatgataa acctattttt tccatcatca 
tgtgtactgt gaaaaaaaaa aaa 



gcactactga attaaatcat catgcatatt 1020 
atttggatag agactgtgct ttgtcacctg 1080 
cttacatacc ttgggggcca gatgtgaata 1140 
taacctacca gggattcctt tcccagtgga 12 00 
gcctggaata tttgggctat ctaggctatt 12 6 0 
cagctgttac agtgacaaga gataaaaaga 132 0 
atgtgttcag atgtaatgta attggagtga 13 8 0 
ctcttcttgg aagaaactta atgaggcaga 144 0 
atgcgattaa cactgtttat gtatatggac 1500 
cagaatcgga atttctaact gaagctgaaa 156 0 
atgtcagcaa tcccaaatcc tttgaatact 1620 
acagcagaat accttgctta atcgtagctg 1680 
aatacagtat ttcacctact gatttctgca 1740 
tcacttgcaa tactgctgat gcccccagta 1800 
ccatgtatcc gcacgtgaca caagctgacc 1860 
gttttggtgc tactgttttt gcagttttgg 192 0 
agcgatgata taaaaagaaa tactgtccct 1980 
gaatgcttta agttctgcta gaattattga 2040 
atatttgtaa ttcatgcata agagtatttt 2100 
catatggaaa gaaaacagct aacagccaaa 2160 
ggaatatttt gtaaatatat gtacatattc 2220 
tagaaatgat gtttctagac atttctaagt 2280 
tctaggttta agttagccca gagattgtat 2 34 0 
tttactcaga atgatccttt gggttctata 2400 
tccattttta aaaacataca agtcagtgtc 2460 
ttgtggtagg ccagccttga agccatcgca 2520 
agctcacctt taagggtgaa gttaggtaaa 2580 
tatgttgctt ccttgtcagt atgttgaatt 2640 
atttgtatat taccaatgtt tttagtttaa 2700 
tcatcccaag tgtaaatcat tctataatgg 2760 
gcatgtgtca ccaagtgttc tatatcaggc 2820 
atgataaaat aaatctaatg aatataaact 2 88 0 
gccttttcaa gtatttaaat aaataactgc 2940 
2963 



<210> 57 

<211> 3332 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3332415CB1 

<400> 57 

gcctggcaga ggctggcggg catcgtgccc 
ccgccccacc ccctcctccc tgccatggag 
ccgcggaggc gaagccgccg ggcctctggg 
gccgacacct ccgggtcgga gctggacggg 
acattcttaa cacctggccc caactgggac 
gatgatgatg ttgtaagcct tagcagcctt 
cgacctctgg ctcgtgtcac gtccttggca 
gtcagacgtt ttggtcaaac aatacagtca 
gcctctgccc agaagttttc tagcaggtca 



gtccctgccg gtctcccggg cacccggcca 60 
cccgagctgg acgctcagaa gcagcctcga 120 
ctcagcacgg agggagcgac ggggccttcg 180 
agatgttccc ttcggagagg cagctccttc 240 
ttcactttga aaagaaaacg cagagagaaa 300 
gatctgaagg agccaagcaa taaaagagtt 36 0 
aatttaatct ctcctgtaag aaatggagct 42 0 
tttacccttc gtggtgacca cagatcccca 480 
acagtcccaa cacccgccaa gagaaggagc 54 0 
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agtgcactgt ggtcagagat gctggacatc accatgaagg agtctctcac caccagggag 600 
atcagacggc aggaggcaat acatgaaatg tcccgaggtg aacaggattt aattgaggat 660 
ctcaaacttg caagaaaggc ctaccatgac cccatgttaa agttgtccat catgtcagaa 720 
gaggaactca cacataCatt tggtgatctg gactcttaca tacctctgca tgaagatttg 780 
ttgacaagaa taggagaagc aaccaagcct gatggaacag tggagcagat tggtcacatt 840 
ctcgtgagct ggttaccgcg cttgaatgcc tacagaggtt actgtagtaa ccagctggca 900 
gccaaagctc ttcttgatca aaagaaacag gatccaagag tccaagactt cctccagcga 960 
tgtctcgagt ctcccttcag tcgaaaacta gatctttgga gtttcctaga tatccctcga 1020 
agtcgcctag tcaaataccc tttactgtta aaagaaattc ttaaacacac tccaaaagag 1080 
caccctgatg ttcagcttct ggaggatgct atattgataa tacagggagt cctctctgat 114 0 
atcaacttga agaaaggtga atccgagtgc cagtattaca tcgacaagct ggagtacctg 1200 
gatgaaaagc agagggaccc cagaatcgaa gcgagcaaag tgctgctgtg ccatggggag 1260 
ctgcggagca agagtggaca taaactttac attttcctgt ttcaagacat cttggttctg 1320 
actcggcccg tcacacggaa cgaacggcac tcttaccagg tttaccggca gccaatccca 1380 
gtccaagagc tagtcctaga agacctgcag gatggagatg tgagaatggg aggctccttt 1440 
cgaggagctt tcagtaactc agagaaagct aaaaatatct ttagaattcg cttccatgac 1500 
ccctctccag cccagtctca cactctgcaa gccaatgacg tgttccacaa gcagcagtgg 1560 
ttcaactgta ttcgagcggc cattgccccc ttccagtcgg caggcagtcc acctgagctg 1620 
cagggcctgc cggagctgca cgaagagtgt gaggggaacc acccctctgc gaggaaactc 1680 
acagcccaga ggagggcatc cacagtttcc agtgttactc aggtagaagt tgatgaaaac 1740 
gcttacagat gtggctctgg catgcagatg gcagaggaca gcaagagctt aaagacacac 1800 
cagacacagc ccggcatccg aagagcgagg gacaaagccc tttctggtgg caaacggaaa 186 0 
gagactttgg tgtagagaag gctctgtgtg ttaactgatg ggagagactg tttgtttata 1920 
aatgtgtaca gttttgtttt ctcgtaaggg gagcatcata gggttacttt ataccagttg 1980 
taacattttc attgtttttg gttgttcttt tttctttttt taatggcagc taaagatata 2040 
cagattactg ttaaattgca gtcctttttt ttttaaagat attttcttga gttatttaga 2100 
acatggtaag cctggtattt tttaatcaaa caaaatattt atgaaatggg ttttctctta 2160 
attctggatt catcatggct ttctaatacc aattgtaata tttacaatat tcaccaaaac 2220 
ttagaatttt gcaaatgctg gaattctgcc agtgtttctt tgctaagcct tgcatgcaaa 2280 
atttgaaatt ttaacattgg cacccaaaac ctacatggaa tgtatgtctg gagtatttca 2340 
aactttacat tgaaacataa tttccttgga aaacaaacca taagcctgag gaggttttta 2400 
tcaactggaa tgctttatat tagtttgttt ttcactgtac attcctcatt ttacattcat 246 0 
ttaacctgcc gattatttaa tttttttatt gtaaagtagt ttttagcatt tgcttttatt 2520 
tttttacttt gatgcctttt caaattggca tgtctttaaa gtatttttct tcctgattaa 2580 
aaatgtgtgt gtatgtgtgt gtgtgtgtgt atatatatat atttttttaa atcacattaa 2640 
ttttaccaag tgaaaccaag ccatactgtt tttgagccaa ttaagaaaat tgccattttt 2700 
aaagtgtagc atttcagggt aaagacccat gaaatggctt gatgtattct agactactga 2760 
aagaaaacca cttcaaagat tttgttgaaa gttttagtgt tgtctgaaat gcaagaggga 2820 
aggtgattgg tagtgagtta aaagaaaaag agaggaaaag agagtagttt tgtcttcaag 2880 
taaaatgtct ggttgtgcca gacatttcac aagtgtgaaa ggagatagga gaagctcaac 2940 
ttgagggcgt gtagtaagtt gtagaaggct cgaggggacg tggacttatt tgccttggtt 3000 
tgcaatacct gcaaataatg agtttgaaaa gaaacaatga aatgtgttaa aaatttgacc 3060 
atattagata aattttggtg gatttagtca taagatggaa aaagactggt gaatctttta 3120 
ttacaaaatg tttctgttaa aatgggatca tcatctttga aaggggggag gaggagtaaa 3180 
agcccgatta taatggtgat caattcaagt cagtgttgac tattctgtga aatatatttg 324 0 
gccagtggaa atgataatca gaaaagactg taaatagatc catccaaatg atttctctgt 3300 
acaaatgaat gatactatta aaaaaaaaaa aa 3332 



<210> 58 

c211> 2617 

<212> DHA 

<213> Homo sapiens 

<220> 
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<223> Incyte ID No: 4031536CB1 
<400> 58 

tttagtaatg tgcctgtatt acatgtagag 
tgtcaaaacc gggaaaacct actctaaacc 
caaaagagcc tctaccacat caaactgtga 
aaggcctccc tttttgtcga agacggatga 
gctctttttc tttgggaaag aaaccatgca 
ttgtaccatg ttcagcaaca ccaacaactt 
ggcaacaacg acgccgaatt acatctgtcc 
aaatgtttca gagctggagt ggaccagaga 
gttgtgaacc aacacaagta aaacatatga 
acttcatttc attgctccct aaagagttgg 
aagacctgct acaagcagct cagacatgtc 
ttctctggag agagaaatgc aaagaagagg 
gaaaagtaat aaaaccaggt ttcatacaca 
acagaattga tactaactgg aggcgaggag 
atgatgatca tgtgatcaca tgcttacagt 
atgacaacac tttaaaagtt tggtcagcag 
gacatacagg tggagtatgg tcatcacaaa 
cagatcggac actcaaagtg tggaatgcag 
ggcatacttc cactgtgcgt tgtatgcatc 
gagatgccac tcttagggtt tgggatattg 
gtcatgttgc agcagtccgc tgtgttcaat 
atgattttat ggtaaaggtg tgggatccag 
ggcatactaa tagagtctat tcattacagt 
ttgatacatc aatccgtgtt tgggatgtgg 
ggcaccagtc gttaacaagt ggaatggaac 
cagattctac agttaaaatc tgggatatca 
gtcccaacaa gcatcagagt gctgtgacct 
ccagctcaga tgatggaact gtaaaactat 
acctagtcac attggagagt ggggggagtg 
acacaaagct ggtgtgtgca gttgggagtc 
tgctggactt tgatgtggac atgaagtgaa 
tagacgatat actccctgcc cttccccctg 
aaatcccttg ttctcagtgg tgcaggatgt 
cagactaaga aggaaaagaa gaagagatga 
tgtctcatca cataaaaggc ttcacttttg 
ctaaatcaaa ccaggtgcaa ttatttcttt 
ttaatgctga aacatcatta cagattctgc 
cacctagaaa ggaactgcaa taatatcaaa 
gcatctgcaa caaaaagtca tttttctgga 
tgtttttgta cagttatcat gaaaagcttt 
tcaatcaatc acagtattag cctctgttaa 
fccaatgcata tgttgctcaa aggtggcaag 
atttaattct tgatgctggt gctagaagta 
cctgtactgt actcccagtg gccaaactta 



agtattcgtc aaccaagagg agttttaaaa 6 0 
atggcttggt tcctgttgat cttaaaagtg 12 0 
tgaggatatt tagcattagc atcattgccc 18 0 
aaagaaagtt ggaccatggt tctgaggtcc 24 0 
aagtctcaga atatacaagt accactgggc 300 
ttggggacct cagagcagcc aatggccaag 36 0 
agccacctac aggcctccag gaatggctaa 420 
aattgcttgc tttagatgaa ctcattgata 480 
tgcaagtgat agaaccccag tttcaacgag 54 0 
cactctatgt gctttcattc ctggaaccca 600 
gctactggag aattttggct gaagacaacc 660 
ggattgatga accattgcac atcaagagaa 720 
gtccatggaa aagtgcatac atcagacagc 780 
aactcaaatc tcctaaggtg ctgaaaggac 840 
tttgtggtaa ccgaatagtt agtggttctg 900 
tcacaggcaa atgtctgaga acattagtgg 96 0 
tgagagacaa catcatcatt agtggatcta 1020 
agactggaga atgtatacac accttatatg 1080 
ttcatgaaaa aagagttgtt agcggttctc 1140 
agacaggcca gtgtttacat gttttgatgg 1200 
atgatggcag gagggttgtt agtggagcat 126 0 
agactgaaac ctgtctacac acgttgcagg 1320 
ttgatggtat ccatgtggtg agtggatctc 13 80 
agacagggaa ttgcattcac acgttaacag 1440 
tcaaagacaa tattcttgtc tctgggaatg 15 00 
aaacaggaca gtgtttacaa acattgcaag 1560 
gtttacagtt caacaagaac tttgtaafcta 162 0 
gggacttgaa aacgggtgaa tttattcgaa 16 80 
ggggagttgt gtggcggatc agagcctcaa 1740 
ggaatgggac tgaagaaacc aagctgctgg 18 00 
gagcagaaaa gatgaatttg tccaattgtg 1860 
caaaaagaaa aaaagaaaag aaaaagaaaa 1920 
tggcttgggg caacagattg aaaagaccta 1980 
caaaccataa ctgacaagag aggcgtctgc 204 0 
actgagggca gctttgcaaa atgagacttt 2100 
attttcttct ccagtggtca ttgggcagtg 216 0 
tagcctgttc ttttaccact gacagctaga 2220 
acaagtactg gttgactttc taattagaga 2280 
gtggaaaagc ttaaaaaaat tactgtgaat 2340 
tttttttttt tttgccaacc attgccaatg 2400 
tctatttact gttgcttcca tatacattct 2460 
ttgtcctggg ttctgtgagt cctgagatgg 252 0 
ggtcttcaaa tatgggattg ttgtcccaac 258 0 
tttatgc 2617 
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